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ON THE THEORY OF SYSTEMATIC SAMPLING^ I 

By William G. Madow and Lillian H. Madow' * 

1. Introduction. It is no longer necessary to demonstrate a need for the 
theory of designing samples. Many of the policy and operating decisions of 
both government and private industry are based on samples. There has been 
an increasing tendency in government and industry to make use of sampling 
theory.* 

Unfortunately there are still considerable differences between the theory and 
practice of sampling. The origins of these differences are, on the one hand, the ig¬ 
norance of administrators concerning the practical contributions that samp¬ 
ling theory can make, and on the other, the lack of sampling theory permitting 
the evaluation of some useful sampling designs. 

]\Iuch has been and is being done towards bringing theory and practice into 
agreement.^ Administrators and samplers are each successfully educating the 
<jthers. However, there still exist sampling designs for which an adequate 
theory has not l>cen developed, even though experience indicates that if such a 
theory were developed it would demonstrate the superiority of those designs 
over others for w^hich a theory has been developed. 

Perhaps the major omission of sampling theory today is the lack of any statisti¬ 
cal meth(Kl for reaching a decision on whether to take a completely random 
sample of n elements of a population of N elements, or to take a systematic 
sample, that is, to begin with element and select elements f, i + /:, • • • , f + 
{n — l)A*, as the sample, the starting point i being chosen at random and N == 
kn approximately.*’ It is with respect to this question of whether to take a 
systematic^ or random sample that the statistician is in a dilemma because he 
has the alternatix e of recommending a systematic sampling procedure for which 
no theory exists, or a random sampling procedure that may well yield worse 

^ Bureau of Agricultural Economics and Food Distribution Administration, U. S. De¬ 
partment of Agriculture. Washington, D. C. 

* Presented at a meeting of the seminar in statistics of the Graduate School, U. S. De¬ 
partment of Agriculture, November 2, 1943. 

^ The recognition of the need for statisticians who know sampling theory has resulted 
in courses in sampling being given in some of the colleges and universities. 

^ One need only refer to the recent development of positions, the duties of which include 
giving advi(^e on sampling techniques as well as working in the field of application. 

^ In this paper we will assume that N « kn. To do away with that assumption would 
not add much in the way of generality while it would require some fairly detailed discussion. 
It may be remarked that when .V is not exactly kn, then systematic sampling procedures in 
wdiich all starting points have eipial probability of selection are biased, although the bias 
is usually trivial. If N is known this bias can be removed by sampling proportionate to 
possible size of systematic sample. 

^ As we define systematic sampling procedures, a systematic sampling procedure is a 
random sampling procedure in w’Lich many of the selections of n from N items are ex¬ 
cluded. 
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results than the systematic procedure. The purpose of this paper is to resolve 
that conflict by providing an adequate theory of systematic sampling. 

In the following sections we present the first parts of our research in the theory 
of systematic samples. Although this research covers both the theory of samp¬ 
ling single elements and sampling clusters of elements, w^e shall consider, in this 
paper, the sampling units to be single elements, not clusters of elements. The 
latter problem will be dealt with in a later paper. We shall present the theory 
of systematic sampling both from an unstratified population and a stratified 
population. Formulas for the mean value and variances of the estimates are 
derived. Comparisons with random and stratified random sampling designs 
are made. Furthermore, the estimates of the variances and formulas for “opti- 
mum^^ size and allocation of samples are derived. 

A fundamental part of the analysis is the demonstration that from a knowl¬ 
edge of the variance of the population^ and certain serial correlations or serial 
variances, can be estimated the variance of estimates based on systematic 
samples. The basic results are: 

a. if the serial correlations have a positive sum, systematic sampling is worse 
than random sampling, 

b. if the serial correlations have a sum that is approximately zero, systematic 
sampling is approximately equivalent to random sampling, and 

c. if the serial correlations have a negative sum, systematic sampling is better 
than random sampling. 

2. The use of a finite population. In this paper we assume, for the calcula¬ 
tion of the expected values, that we are sampling from a finite population of ele¬ 
ments even though the size of the population may be large enough to permit the 
use of limiting distributions. Often, this is, mathematically, a matter of choice. 
The same results would be obtained hy assuming a correctly defined multivariate 
normal distribution and using the notions of conditional probability. From a 
physical point of views how^ever, there are several factors that lead to the use of 
the finite population. We are most frequently sampling an existing population 
whose laws of transformation are either unknown or not mathematically ex¬ 
pressed.® Consequently, the notion of a normal or other specified distribution 
from which w^e sample and use conditional probability is not part of our thinking 
concerning the physical problem. On the other hand, if we consider the popula¬ 
tion to be a finite population, and use a table of random numbers to draw our 
sample from the finite population, we are using only mathematics implicit in 
our physical problem. Furthermore, we do obtain a repeatable experiment, 
that of selecting a random number, that we know is in a state of statistical 
control. 

In the usual problems of the theory of random sampling, the number of 

^ By “variance of population*' without further qualification is meant the variance of a 
random sample of one element of the population. 

< In other words, our population is not in a state of statistical control over time. 
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possible samples yielding different sample means is large enough so that the 
sample means may, with a sufficiently large size of population and sample, be 
expected to be approximately normally distributed. In systematic sampling, 
however, the number of possible sample means is usually very small and even if 
the sizes of population and sample are large, it is difficult to assume a normal 
distribution. Consequently, in our interpretation of the means and variances 
of systematic samples we are led to regard the elements of our populations as 
being the results of single observations on random variables, the distributions 
of which may vary from element to element. The interpretations that we then 
make become interpretations of conditional probability, and if the sizes of 
population and sample are sufficiently large, we can assume that the arithmetic 
mean of each of the possible sample means is normally distributed. 

The theory of systematic sampling under the assumption of an appropriate 
normal multivariate distribution will be dealt with at a later time. 

3. Definitions. Jjet the finite population to be sampled consist of N elements, 

By a sample design is meant the combination of a method of classifying these 
N elements into k classes that may or may not overlap, and a method of select¬ 
ing one of these k classes, each class having a designated probability of being 
selected. The sampling procedure associated with a given sample design is the 
operation of selecting one of the k classes according to the method stated in the 
sample design. The sample is the particular class obtained by the sampling 
procedure. 

By a random sampling procedure is meant any sampling procedure such that 
if the sampling design yields k classes then the probability of selecting anyone 
of these classes is l/k. Any sample design having a random sampling procedure 
associated with it is a random sampling design. One of the nonrandom samp¬ 
ling procedures that is being used is the procedure in which the classes have 
associated to them numbers, called sizes, and the probability of a given class 
being the sample is proportionate to its size.® Other nonrandom sampling pro¬ 
cedures are doubtless being used, 

By an unrestricted random sampling design for selecting n elements from N 
elements is meant the sampling design such that there are Cn classes, the possible 
selections of n from N elements, each having a probability of 1/Cn of being the 
sample. The associated random sampling procedure might consist in identify¬ 
ing each class by a number t, i = 1, * • * , Cn and selecting a number i from a 
table of random numbers. The random sampling procedure might also consist 
in identifying the N elements with numbers j — 1, • • • , A^, and then selecting 
a number j from a table of random numbers, then selecting a different number j 
from a table of random numbers, and following that procedure until n numbers 

• For a discussion of this problem see the paper entitled, *‘On the theory of sampling 
from finite populations,*^ by Morris H. Hansen and William N. Hurwitz, Annals of Math, 
StaL, Vol. 14 (1943), pp, 333-362, 



4 


WILLIAM O. MAOOW AND LILLIAN H. MADOW 


from 1, , N without repetition have been selected from the t«^le of random 

numbers. The elements associated with these integers would be a random 
sample. It is easy to see that the two procedures are equivalent. 

A random sampling design that is not unrestricted is said to be restricted. 
There are many types of restricted random sampling designs of which what we 
call systematic designs are only one. Among these restricted designs are 
stratified, cluster, double, matched, pol}momial, and other sampling designs, 
each having been developed as attempts to bring theory and practice together, 
to suggest improvements in practice, and to solve problems arising in practice. 

By a systematic sampling design is meant a classification of the N elements 
into k classes, 8i, •••, St where Si consists of Xi , *,+*,, • • • , x<+(,_i)jb, and a 
random sampling procedure for selecting one of the Si . 

It is thus clear that a systematic sampling design is a type of cluster sampling 
design. It will be shown that the new aspect of cluster sampling introduced in 
systematic sampling is that a knowledge of the order of the elements in the 
population is used to obtain the values of the intraclass correlation coefficient 
and changes in the value of that coefficient as the size of sample changes. 

Sampling designs may involve combinations of random and systematic samp¬ 
ling designs, as well as random and nonrandom sampling procedures. 

The population from which these samples are drawn may or may not be strati¬ 
fied and the sampling units may be single elements or clusters of elements. 

4. Bases for selecting among sample designs. From the many sampling de¬ 
signs that can be constructed in order to obtain desired estimates, one will be 
chosen for use on the bases of administrative considerations, cost, and sampling 
error. It has become customary, on the basis of limiting distribution theory 
and the theory of best linear unbiased estimates to use the standard deviation 
of the sample estimate about the character estimated as the measure of tompling 
error. 

Although in tiiis paper we shall continue this practice, it must be pointed out 
that as more sampling designs are constructed, there is the danger that for some 
of these designs the limiting distribution theory is not valid, and the use of the 
standard error becomes more a matter of custom than the result of analysis. 
This danger is present for systematic sampUng designs and is being further in¬ 
vestigated. 

It is perhaps desirable to remark that bias, consistency, and efficiency are 
properties of the sampling design and estimation functions used, not of the 
particular sample obtained. Any estimate based on a sample will probably 
differ from the character estimated. It is the function of statistical analysis to 
indicate how large this difference may be. 

ft. Hotatkm. The letter, P, with appropriate subscripts is used for popula- 
%im, and sul^pulations such as strata. 



■■ SinBTSKAtlC^SAimjm'' . ^ 

The number oi strata is dmoted by L, and the nufiaber of elements in 
stratum is denoted by Ni. Sises of sample are denoted by n with 
subscripts. 

The arithmetic mean of the elements of a populatibn or subpopulation is de¬ 
noted by £ with appropriate subscripts. 

Any ptuticular subclass of a population as defined by the sampling design is 
denoted by iS with subscripts. Estimates based <hi an <S with subscripts aref 
denoted by f with subscripts. 

6. Unstratified systematic sampling, the samiding unit consisting of one de¬ 
ment. The values assumed by the subscripts used in this section are given in 
Appendix A. ' 

Let the population, P, consist of N elements xi, ••• , x/r. It is desired to 
estimate the arithmetic mean, x, of P. 

Let® N = kn, and let the class Si consist of the n elements , x<+* , • • • 

• Then, the systematic sampling dedgn for estimating i from a sample 
of size n, consists of the k classes. Si, Sk, and the requirement that &e 
sampling procedure be such that the probability is 1/k, that Si is the class se¬ 
lect^ by the sampling procedure. 

Let £i be the arithmetic mean of the elements of Si , i.e., n£i = x< 4- *.+* + 

• • • + a:. 4 (n-i)*, and let x be the sample mean, i.e., f = if Si is selected by 
the sampling procedure. 

In dealing with S 3 ^tematic sampling, we shall have occasion to use both the 
circular and non-circular definitions of the serial correlation coefficients and the 
associated serial variances. 

We shall assume that if ^ then Xk = Xh-kn . This is used in the circular 
definitions. 

Let kruA =* ~ 

^9 

and let knC/k, = — S)(x,+kk ~ 

9 

Then, the circular definition of the serial correlation coefficient pk^ is a*pkk ** 
Ck ^, which we shall use unless n is even, when W'e define Pkn/t by the equation 

2a*ptn/* *“ Ck»/t, 

in order to simplify the Writing of the formula for vt . 

Similarly, if we define the serial variance, «<„, by the equation knskk 

9 

(x, — x4«p)®, then we are using the circular definition of the serial varianoe. 
The circular definition of the serial variance ratio Vk„ is then <r\kn 
we shall use unless n is even, when we define Vkn/t by the equation 

ZffVn/* ■■ Sta/i. 
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The non-circular definitions of the serial correlations and serial variances are 

— x)(Xj+kl — x), 

) 

Cm, 

i 

/ 

a Vhi =* Sk% • 

The intraclass correlation coeflScient p* is defined by the equation 
cr% = ~ x){x^ — x), 

where the random process consists in first sampling one of the Si at random and 
then selecting two of the x’s at random from the Si that was selected. Then, 


since 




kal = ““ 

xf, 

and. 



(2) 

o^Pk = (n/n — l)<rj 

(1/n - ])(7* 

we have 



(3) 

~ (1 + (n - 
n 

• l)pt) 


given by 

( 1 ) 


and 


k{n b)C[, = 


2 f 
Pfca 


k{n - = 


It is easy to see from (1) that the intraclass correlation coefficient is given by 

“ ;r^i ? ’ 

and that consequently, if n is odd, p* is the arithmetic mean of the p^^ while if 
n is even, ph is equal to the arithmetic mean of the p*^ multiplied by n/{n — 1). 

Theorem^®: Using the systematic sampling design, the estimate x is an un- 
biased estimate of x, and has variance (r\ where 

<rj = /l-\ (w — 

{ ' n^ 6 

(4) “ 

= ^( 1+22 Pin) 

n ^ 

= ^ {1 + (»— 

n 


10 A proof of Theorem 1 that is somewhat simpler to follow but which, in the authors 
opinion, is not as informative as that given below could be obtained by substituting for 
pk using equations (2) and (3). The lemmas in Appendix B are, of course, of interest in 
themselves in finite sampling. 



f 


systtehatic sampling 


Proof: From the definitions of expected value, and the systematic samp** 
ling design, it follows that x is a variate with possible values f^ • , «* , the 
probability that at = x* being l/k. Then 


(5) k&z ** + • • • “t" > 

and, when the values of the 5, are substituted in (5), it follows that Sf « 
that is, ^ is an unbiased estimate of x. 

Having calculated fix, it is necessary to calculate in order to evaluate 
ffl . From the definition of expected values, it follows that 

( 6 ) k&x^ 1 4 “ • • • 4 “ > 

and when the values of the Xi are substituted in (6), it follows that 

(7) n^k&x^ — X) 

i* 0 i,y 


Then, when fiu) is replaced by in Lemma 6 of Appendix B, it follows, from 
the definition of the variance, that crl = £ (®/ — 

- a L(n ~ 6)^ , and when f(u) is replaced by w in Lemma 8, it 
vr 8 

follows that 

^ (s) ? <*■ - *>■+(&) I? -«■ 

■0)I 

If in Lemma 9 of Appendix B we now replace /(xy, xy+w) by (xy — Xy+w)* 
^ben (Tj “ <r ( “ j > 

W » 

and if we replace/(xy, xy+w) by (xy — x)(xy 4 .M — x) then 

<^* = ^ (<^* + 2 I 3 Ct^). 

n 


Finally, we have, then 


O’* 


and 


2 

O'* 


^ (1 + 2 2 fii*)’ 
n n 


7. Possible values of ttie pit, pkn and al . Let us investigate briefly the effects 
of different patterns of variation on the values of p'u and ai . Now <r*pu 

ji!(n - ~ g ) ^ ~ ~ Suppose that x< « x.+m , 5 == 1, • • • , n - 1, 
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t = 1, • • •, fc., Then 21 (*» — ^)* = n £ (*< — xf, and 2 (xj — £)(.Xj^ki *) 

V i j 

= (n — 8) 2 (Xi — ly. Upon substitution it follows that pis = 1, and <tI ^ a’. 

i 

This result for trl is intuitively clear, since all the variability is among the 
possible samples, and thus any particular systematic sample is equivalent to 
one observation. 

Suppose, on the other hand, that Xks+a - Xks+»a, |3 = 1, • • • , fc; 5 = 1, • • • , 
n — 1. Then ^ (a:, — i)* = fc 2^ (a:,+(o-i)i — if for any i, i = 1, • • • , k, and 

V a 

22 {Xj - ^)iXj+ks - *) = A: 22 (a;•+(^-l)* - x)(x.+(x+j-i)* - *). Furthermore 

i X 

0 = [53 (il'i+(o—1)A; ■“ ^)] ^ 53 (^t+Ca—l)Jfc “i“ 2 53 (Xx-^(\^i)k x)(x»4.(X45—1)* 

a a \,6 

— x). Hence 

222^^^pm=- 1 and al = Q. ' ‘V 

^ 5 U 

It is possible to constnict examples in which any particular pia = — 1, but in 
such cases the remaining pU each vanish. It is well known that the minimum 
value of p* is — l/(n -- 1). 

Finally, let us consider the expected values of pu and qI if the ;r’s have been 
assigned their subscripts at random. These values are ph = — l/(rifc — 1) 



In most practical applications of systematic sampling it will be highly unlikely 
that the distribution of the will be such that the x^s may be said to have been 
assigned their subscripts at random. In general, there will be logical reasons 
to expect that the x^s will have some fundamental trend. Thus, information 
will often be available, or may be obtained by a small subsample, on the basis 
of which a decision can be made to use some approach differing from that of 
assuming the subscripts of the z^s to have been assigned at random. 

8. Estimates of the parameters. The formulae obtained in section 6 for the 
variance of the mean of a systematic sample are population formulae. Their 
values depend on the values of all the elements of the population. However, 
even in tests of possible sampling procedures, we rarely have available the re¬ 
sources with which to study the entire population. Consequently, it becomes 
necessary to investigate the possibility of estimating the population variances 
and serial correlations from samples. It will be shown that the estimates of the 
variances and correlations derived from a single Si are biased and inconsistent 
whereas it will be possible to construct unbiased or consistent estimates from 
samples of more than one of the Si . The sampling variations of these estimates 
must be left for further study. 

Let us assume that instead of sampling only one of the Si , as we did in section 
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6 , we sampled 9 of the iSj at random. Then our sample would oonrist all the 
elements in the Sf. The sample mean, i, is defined by 

0 

if the subscripts of our sample classes are u , * ■ ■ , . 

Then it is easy to see that i is unbiased Furthermore, since we can regard 
this sampling procedure as the sampling of 9 of A; elements at randcwa, it follows 

that ffl ■= Y -V ~ we have evaluated vl in section 6 .'^ 

K — 1 g 

Since 

kal « 2 (*< — £)*, 

{ 

we shall consider estimating vs by where 

gsl » Z (^ “ i)*. 

0 

Now, since E £* = al + f*, and 


E = I L + «*) 

0 hi 

it follows that Ssj » al , and hence s| is an unbiased estimate of v|. Further- 

more &8a = —- - vj, 

k - g 

We now turn to estimates of the pi^ and v*. 

Let 




E — £)* 


0,a 


and let 


Q^kng 23 D* ^)(^4'(«+M—l)fc ^)» 

0,a 


Then, it may be shown that 




2 2 
<r — <ri , 


and 

Hence 


&Ckni ^ Cki0 ^ • 


It may be wondered why we sample these Si at random rather than systematically. 
If we sampled the Si systematically, it would be equivalent to taking a single systematic 
sample having smaller intervals between elements of the sample. Furthermore, we could 
not derive the unbiased estimates of the sampling variance that we can now. 
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The estimate, r*,,, of p*^ defined by 

[^" (^^)] 

is thus biased but in many cases the bias will be small Of course, if m = 

when n is even then rkn /2 is multiplied by 2 to estimate pkn /2 as previously defined. 
Another approach would be to consider ^ 

QTl to^g — 5 


/S.a 


and 


Qflvo^kng — ^ ^ (^/9+{.a—DA: ^/?) (^/9+(a+^—1)A: ^js)* 

/J.O 


When that is done, it follows that 

c -2 2 2 

G) = cr — (Tj , 

and 

^ tpCkiig ^ ^kfi ^2 j 

and 

HX + si) = (T^ 

&iwC]^g + sl) = Ckn . 

Another estimate of pit^ is thus defined by the equation 

wCkiig “i” ~ ti>rA;ii[ti7^^ “j“ S^]. 

When == 1, Sp = 0 and we are unable to provide unbiased estimates of 
Ckn , and <tI from the sample. However, since 

1 Tkn ^ ^g ^kng 


1 - rk„ 

it follows that approximately 

1 Pit/* <r 1 J^A:/* 

- SS Qy - J 

1 — Pit/I' 1 -- Tkft* 

since &[6l ~ . Similar equations hold for the wTkn . 
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When we estimate the pit , then the ‘‘within class'' definition is simpler. Let 
k{n — S)w<^ai OS 22 (*<+(x-i)* ~ iii<)(*< 4 (x+»-i)j! — M^<)i where 

t#X 

(n — ^iai» *= S iPt+(x-i)*i, 

(w — = 23 a^<+(x+»-i)*, 

X 


and let 

g(n — i)u>CA3i, == X) (^+(x-i))fc — i«^^)(iCjj+(x-M-i)* — 2ii%). 

AX 

Let 

kiCkig “ 2i!) ““ ^)> 


and let 

ff6Cib5<, = 23 - ^), 

fi 

Then 

/ t . t 

Cki *= wCjfea^ + bCibaa > 

and 

S(trCit8a + *= Cjfca , 

Thus, as estimates of the m we obtain rla where 

+ ifik^o — Tkiiv^o + Sa)« 

In cases where x is known simpler estimates of the pki ,, ph , and may be easily 
obtained since 

§23 (^j9+(a—1)A ^)(3!^3+(a-H»—1)* x) =* QTlCkn 7 

0.a 

§23 (a^^+(x-i)A: - x)fe+(x+a.i)A: - x) = sr(n-5)Cfca, 

AX 

and » 

§23 (^+(a-i)* — 

Aa 

Thus, in pilot studies, when £ is known it is possible to estimate the param¬ 
eters in al from even a single sample. 

9. Changes in the variance with changing size of sample. The chief reasons 
for expressing the variance of a systematic sampling design in terms of the 
variance of a random sample and the serial correlation coefficients were 

1. To enable the making of comparisons with random and other sampling 



12 


WILLIAM O. MADOW AND LILLIAN H« MADQW 


2. To simplify the analyids of causes for the difference in the efficiencies of the 
systematic and random designs, and 

3. To simplify the making of estimates of the variance for different siaes of 
sample. 

In this section we are concerned with the third of these reasons. We shall 
discuss only the p*^ since the analysis in terms of the pla is very similar. 

The problem with which we are concerned is the estimation of the function, 
Pk , of k. In order to show how this may be done for all values of k when the 
Pkft have been computed for one value of fc, let us first note that since <r* does 
not depend on k we may confine our considerations to the Ckn . In section 6 
we have defined by the equation 


Thus, if we wish to evaluate the Ckw where fc' is such that V k and i'n' « 
/pn « Nj we have the result CjkV' = C'*#* if = kp and, thus, for any given 
values k' and p\ we have 

Ck'n* = Ck fcv'/fc 


where we have replaced p by 


k • 


This procedure will involve, if k' < k, some interpolation, but if the p*^ are 
plotted against p, this interpolation may often be carried through graphically. 
However, it is usually advisable to take k so that the possible values of fc' are 
such that k' > k. 

In some cases it may be possible to construct a correlation function. For 
example, if the Xp may be represented by a polynomial in Vf/then pl« may be 
represented by a polynomial in 6, From that fact we conclude that if the Xp 
vary about a smooth trend the p*j will also vary about a smooth trend and it 
may be possible to interpolate. Further investigation of this problem is neces¬ 
sary. 


10. Stratified qrstematic sampling. In sampling practice it is customary to 
deal with stratified populations. The variance of an estimate based on a 
stratified population will usually not include the variability among the strata. 
Consequently, when a population is weU stratified the variability of estimates 
based in a sample of size n will usuaUy be considerably less than the variability 
of an estimate based on a random sample of size n, ignoring the strata. We 
now discuss the theory of systematic sampling from a stratified population. 

Let us assume that the population, P, consists of L strata, Pi, • - • , P^, 
the ath of which contains elements , • • • , XaNa • It is desired to estimate 
Ihe arithmetic mean, 2 of P. Let the arithmetic mean of Pa be denoted by 
• Let N a kf^a • 

We shall consider two possible cases, the first of which is often used because 
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of tbe adounistmtive simpUcity of givuy; idoitical operatwg instruotioQa to tbe 
people selectiiig samples in different places. The results of tlds section will 
indicate when this method may be used. 

Sampling Procedure I —Suppose that h = kt = • • • ki k, and that the 
sampling procedure consists in selecting one of the integers, 1, • • • , X; at random, 
each integer having a probability 1 /k of being selected. Then, if the inte^r 
selected is, for example, i, the sample of Pa consists of Xai , Xai+k, • ’ • , Xa<4(n.-iU. 
Thus, there are exactly k possible samples, St St, each having probalnlity 
1 /k of being the actual sample obtained by performing the sampling procedure. 

Sample Procedure II —The sampling procedure consists in selecting one of 
the integers 1, ■ ■ ■ , A;, at random, for each value of a, each int^er having a 

probability of 1 /ka of being selected. Then, there are exactly ki .Jbi 

possible samples, each having probability 1/A;i. ki of being the actual 

sample obtained by performing the sampling procedure. 

Other sampling procedures for stratified sampling, of course, exist. The two 
listed above, however, cover most practical problems except those involving 
cluster sampling. These will be treated in a later paper. Furthermore, frmn 
the conclusions derived concerning these procedures it will be possible to infer 
conclusions concerning other stratified sampling procediu%s. 

Let Sai be the class of elements Xoi , ®o<+*, • • • , Xo<+(n.-i)* . We consider 
sampling procedure I. A systematic sample of size ti« is to be selected from P#. 
The possible samples are Si, • • • , <8* where Si consists of all the elements in 
Su, ••• , Su . Let the arithmetic mean of the elements in Sai be denoted by 
Xai . Let the arithmetic mean of the sample from Pa be denoted by and let 
the sample mean be denoted by f, where 

Nx = Niii + • • • + JVtii. 

Then N&i = £ Na& 2 a = E 

a a tC i 

It follows from Appendix C, that 

« 

O'* = E NaN^Ci^t^ 

where 

0*^5 ^( 3 ta ” ^a*)iXh ■“ Xb) 

= r 2 (*•< - Xa)(ibi —Xi). 

K i 

Although the expression for 0’*.*^ can be further simplified, the important fact 
is that if corresponding items in different strata are positively correlated, it is 
inadvisable to use sampling procedure I unless other ccmsiderations than sampling 
error are dcnninant. But if the corresponding items are negatively correlated 
then sampling procedure I will yield a smaller variance than sampling proce¬ 
dure II. 
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We now consider sampling procedure II. The difference between sampling 
procedures I and II is that, in sampling procedure II we know that = 0, 
if a 3 ^ 6 because of the separate selection of sample in each stratum. Thus, 


under sampling procedure II, al 




2 N^irla where <y*, has been derived in 


section 6. 


11. A comparison of the efflciences of systematic and random Mmpling 
procedures. The study of any sampling technique is incomplete unless some 
comparisons are made with other possible sampling techniques. In this section 
the systematic sampling procedure is compared with the imrestricted random 
and stratified random sampling procedure. 

The means and variances associated with the random and stratified random 
sampling procedures will be denoted by the use of primes (') and double primes 
(") respectively. 

Then we know that 

and consequently (Tj < if 

2 < —in — l)/2(fcn — 1). 

If n is large relative to k, we may use — as an approximation to 
— (n — l)/2{kn — 1). 

In order to make more specific comparisons, it is useful to assume that the 
population elements Xv are given by some function of v, and to assume some func¬ 
tions jsuch as 

= Aq + Aiv + • • • Ah , 


or 


X, = Bo + ^1 sin + Bi cos 
* N 


2tv 

~N 


+ 


4- Ai, sin 


2Trhv 




and then to investigate the efficiencies of the various possible sampling proce¬ 
dures on the bases of such assumed distributions of the x„. It should be noted 
that the use of the systematic sampling technique involves the assumption that 
it is possible to order the elements of the population in a logical way, and then 
use this ordering in selecting the sample systematically. 

We shall now consider several possibilities. Let us first note that if we are 
sampling but one element from a stratum then the variance of the stratum 
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sample mean is the same whether the sampling is random or systematic. On 
the other hand, it follows from section 10 that if we stratify the peculation into 
L strata so that a systematic sample of size L chooses the jth element of each 
stratum, say, then the variance of the mean of the stratified random sample will 
be greater or less than the variance of the mean of the systematic smnple de* 
pending on whether the average correlation between strata sample means in the 
systematic sample is negative or positive. ’ 

Let us now consider the origin of the warnings against the use of systematic 
samples from a population having a periodic distribution. If k is the period, 
the correlation between the strata means of the systematic sample is +1 and 
hence the random sample is superior. However, if the period is 2k then we 
shall show that the systematic sample will probably have a smaller variance. 

Suppose that the period is 2k and that within two adjoining strata of size k 
we always have Xi = = xat-i, • • • , x* = and x< — x = — (xjb+< — x). 

Then, if we are sampling one element from each stratum, the correlation between 
the systematic sample means, (the individual elements in this case), will be -^1 
if the strata subscripts differ by an odd number and +1 if the strata subscripts 
differ by an even number. 

The variance within each of the n strata is <t\ , where 

ka\ (Xi — x)^ 


The variance between strata means is zero. Hence = ori . The variance 
of the mean of an unrestricted random sample of size n where n = L is then 

~ and the variance of the stratified random sampling mean is 
AT — 1 L 

err' == (1/L)<ri while the variance of the systematic sampling mean is 


2 


2 L 


l)-'(2 - 5.-,) 


where 6, j = 1 if i = j and 5,y = 0 if i j. 

Then it may be shovm that if L is even 0 -^ = 0 while, if L is odd = (1/L*)^?. 
Consequently, the efficiency of the systematic sample mean is greater than 
the efficiency of the stratified random sample mean if the population has a 
periodic distribution and the size of stratum is half the period. It should be 
noted that the same situation holds for k equal to an even or odd multiple of half 
the period as held for k equal to the period of half the period. 

The situation is quite different if we assume that the elements of the popula¬ 
tion have a straight line distribution. Without loss of generality, we may 
assume that the straight line distribution is given by x„ = v. Then for an 
unrestricted random sample of size n, the sample mean being denoted by 
we have Sx' = x = \{kn -f 1), 
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and 


i 

at' 


(k — l)(kn + 1) 
12 


For a stratified random sampling design, let us assume that ATi 


Nr 


n 


where c may equal any of the integers 1, 2, 
c* A; — 1 


n 


, n: i.e. L = - . Let 
c 


m *> • • • * nj, = c. Then al" =■ -j -r —r £ «^a where i" is the sample mean of 

7J,* CW 1 a 

the stratified random sample. If the oth stratum contains , • • • , 


then O'* 


cV - 1 
12 


and 


at’ 


c fc — 1 c A' 




n ck — I 12 


Finally 


(x, - X,'+m)* 


j fc*(Tl* “ 1) 

= (T 


12 


&* - 1 
12 


To summarize 


ai' 


(k - l)(Jfcn +!)_(*- l)(N + 1) 

12 


12 


2 c(k - l)ick+ 1) 

<ft" = — 




12n 


12L 


at = 


- 1 
12 


It is clear that both al and al" are less than al-. However al/al’ > = -jt • 

N i t 

Since kn — N and cL — nit follows that fc = and hence at < at" if N > 

cL 


L{L — 1 ) and c > 


1 


LjL - 1 ) • 
N 


In all cases N > c. It follows therefore 


that for a value of c to exist we must have ^ > rr-- ~t as a result of 

L N — L{L — 1) 

which we find that N must exceed 2L* — L. Hence al < al" if N > 2L* — L 
and c > -. Otherwise al > at 


1 - 


L(L - 1) • 

, AT 
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lliis result follows from two facts: 

(1) If ooe element is being taken from each stratum then the h^ average 
correlation between strata means results in the efficiency of the stratified randmn 
sampling mean being greater than the efficiency dt the systematic sampling memx, 
despite the equal within stratum valances. 

(2) If more than one element is being taken from each stratum then the within 
stratum variance of the systematic sampling mean is less than the within stratum 
variance of the stratified random sampling mean and if the size of stratum and 
sample from stratum are large enough, the smaller within stratum variance of 
the syst^iUatic sample more than compensates for the correlation among strata 
means. 

Of course, in a straight line distribution there are much more efficient methods 
of defining a stratified random sample than that we have used. Furthermore, 
more efficient sampling procedures than those discussed are available. How¬ 
ever, this example will be of use in indicating the general problems that arise 
as well as the procedures that may be followed in attacking them. .. 

Another comparison of systematic and stratified random sampling may be 
obtained by considering the x, to be composed of ^o elements, a trend function 
and a periodic function so that the deviations from the trend constitute a 
periodic function. 

Let 't, — ^(y) -f- ^(k), where ^i(») is a trend function and vt(») is a periodic 
function of period 2/i, AT = 2hQ. 

Let vi(») = • Then yj = = • • • = y»(o-i)+y ,3 - 1, • • • , 2fe and 

!/»«+> - 5 = -(y»-+»+y - i » 1, • • • , h, o - 0, • • • , Q - 1. 

Since the sizes of sample that we shall consider for purposes of this comparison 
are all multiples of h, we shall calculate our variances and covariances so that we 
obtain all the necessary information at once. 

Let the mean of ^(i') be denoted by ^ and let the mean of ^(v) be denoted 
by y. Then i ^ -h ^ and 

= £ — ?i]* + Z) (y' ~ 5)* + 2 23 *" S) 

9 9 r 

= 23 (^{o-DA+i — ^la)* + A £ (^la — 
a,i a 

+ 23 “ ^«)* + A 23 (^o “ yf 

a,i a 

+ £ [^l(a-l);i+i — ^la](y(a^l)h^ ““ Ja) + ]C (?la ^>l)(5a — ^?) 
a.t a 

where o * !,•••, 2Q; i — 1, • • • , A, is the arithmetic mean of vi[(o-i)*+i], 
- * ■ , ^Kok] and is the arithmetic mean of y(a-i)k 4 i 

It follows from the assumptions with respect to the y, that ^ § and tiiat 

h 

21 (y« ~ 9“)^ js same for each value of a and is equal to 23 (y< ~ ?)*• 
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Since yi ~ yvi^ = • • • = 2 / 2 A(q-i)+* and = * 

we have 

WiM ^l](2/» 5) = 2 (y* "" 5) ^ [^U2a-2)H» “ ^12o-.2] 

I' ^ ♦■■I c*»l 


y»(0-l)+A4< 


.i 

Since Vi - y '= (?/<+» 


.4 « 

+ ]C (y> ~ V)^ (¥>l((Jo-2)A+<) — ?>lS«-s]. 
i-A-Hl a-1 

), we also have 53 — ^](2/» - V) - 


lliVi - y) k((2o - 2)h + i) - <pi((2a - l)h + i) - ^i 2 „_s + <pYta-i] 


o-«i M 


which vanishes for example if <p\{v) is a straight line or if ^i(v) is a succession of 
straight fines each having length 2/i. 


Let us now assume that ipiiv) = 4 + Bv. Then — A + B 


h + 




n 


* . jt +1 

<^i((a — l)h + i) — (pia = i — —-—, 


(a - 


l)h + 


2 B%{h^ ~ 1) 


^ [v?i((a — l)ft + i) — ^af = 

t 

£ . „)■. 

a i« 




12 


ifii = A + B 


2h + 1 


B“ 

Then — ol + [4/i^Q^ — 1] where hal = ^ (t/, — and the variance of 

12 i 

N 71 (T^ 

the mean of an unrestricted random sample of size n is (^^ = -^ - . 

iv — 1 n 

Let us assume now that the size of stratum is mh where m is a factor of 2Q, 
say 2Q/m = Lm . Then the variance within each of the Lm strata is a constant, 
B^ 

say, ffl where <ri = — 1] if m is even. If m is odd then Lm is even 

12 

1 ^ 

and in half the strata the within stratum variance is <rj + — ^ (yi — 

nm *»i 


9) 


2 

<ri 


while in the other strata, the within stratum 


vanance is 



SYSTEMATIC SAMPLING 




Then, if c elements are sampled at random from ^ch of the L« strata, it follows 
that 



In order to evaluate the variance of the systematic sampling mean let us 
evaluate = A;(n — B)8k6 . Now upon substituting for Xp , it 

follows that kin - = X (Vj ~ X (Vj - + 

A:(n - 6)BW. 

Then, if fc is a multiple of /i, it follows that ^ (yj — t/y+w) = 0. Furthermore, 
if fc is an even multiple of h, then yj = yy+a* and hence ^ ( 2 /y — yj+ik) = 0. 

7 

Finally, if A: is an odd multiple of h then, if 5 is an odd number y, — yy+j* = 
2(j/y — y) while, if 5 is even y, — yy 4 j* = 0 and hence 

Z (.Vi ~ yi+*kf = 4 Z (Vi - vf 

i 3 


if k is an odd multiple of h and d is an odd number. Note that if k is an odd 
multiple of h, then n is an even number. Since 


2 

(Ti 


= «r* - 23 (w - 


n- I 


it is necessary to evaluate ^ (n — 3)8*j. Now, if A: is an even multiple of h, 

t 

it follows that (n — d)sti = (n — and 

23 (n - «)«M = 23 «* - Z «’} 


6 

-. 2/2 


* 12 

Hence, if A: is an even multiple of h, it follows that <rl = — 


BV(n* - 1 ) 
12 


On the other hand, if A; is an odd multiple of A, and if d is odd, 
we have {n — d)sk6 = (n — d)B^k^B^ + 4(n — d)al while if 5 is even (n — 5)slba = 
(n - 5)fiW. 


Hence 


2^ (n - = -;ro-^ + n dy . 


12 
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Hence, if A; is an odd multiple of h, it follows that 

2 2 S‘k*(n^ — 1 ) 2 

--- 

Then, if A; is an even multiple of h 

(rl= <rl + ^ (k^n^ - 1) - ^ **(»* - 0 


12 


12 


ffy + ^ (*!* ~ 1 )> 


and, if A; is an odd multiple of A, then <rj = — ■“ !)• 

Thus, systematic sampling will yield superior results if 

L 


c > 


1 + 


l2ol 


B^(hm)(hm — 1) 


L{L - 1) 
N 


N 


Since -^ > c it follows that for a solution, c, to exist, we must have 
Lj 




12. Summaxy. In this paper we have presented the theoretical basis for 
systematic sampling for stratified and unstratified populations including the 
derivation of the variances, a study of the possible values of the parameters, 
estimates of the parameters, the effects of changing the size of sample, and 
cotnparisons among systematic sampling, unrestricted random sampling, and 
stratified random sampling. The paper contains for the case where the sampling 
unit consists of one element, not only the theory necessary, but in addition, 
some analysis of the conditions under which systematic sampling ought be used, 
and formulas for calculating the variances. 

In later papers of this series, we shall present the theory of systematic sampling 
when the sampling unit is a cluster of elements, the theory when we assume we 
are sampling not from a finite population but an infinite population, each of whose 
elements is normally distributed, and further studies of various parts of the 
theory and practice of systematic sampling. 

Appendix A 

Values Assumed by Certain Variables 

In order to avoid repeating the limits of summation of variables, we shall give 
these limits in this appendix. 
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TABLE I 

Values Assumed by Svbscrvpts 


Letter 

The letter will assume all integral values from 1 to 

i 

k 

X 

n — 6 

j 

fc(n — d) 

6 

n — 1 

Pf V* 

kn 

a 

n 

7 

n 


n 1 

n/2 if n is even, —if n is odd 

A 

a, b 

L 


The letter will assume the values ii, is, • • • ,ig where ii, »• • , ig are a selection 
of g of the k integers 1, • • • , A. 

Appendix B 

On the Limits of Some Finite Sums 

The difficulties that arise in the transformation of finite sums are very similar 
to those that arise in the theory of transforming multiple integrals, i.e., the effects 
of transforming variables or order of summation on the limits of summation. 
Certain lemmas that have proved useful in this paper are presented separately 
here in a more general form. 

Let/(w) and f(u, v) be functions of u and v that are finite for all possible valuer 
of u and v. 

Lemma 1 . 

f{^i+(a—t)k > ^>(7—DAr) ~ 221)*) 

0,7 itk 

o< 7 

Proof; I^t a = X and let 7 = X + 5. Since 1 < a < 7 and 7 < n, the possible 
values of 6 are 1, • • • , (n ~ 1). For any fixed value of 6 the possible values of 
X then are 1 to n — 6 since X = 7 — 5 and, for a fixed value of b the maximum 
value of X is determined when y ^ n. With these limits each term of / on the 
left side <rf the equation occurs once and only once on the right side of the equa¬ 
tion. Furthermore, no additional term occurs on the right side of the equation. 
Lemma 2. 

22 22DAr) ~ 22/fe’ 7 

i X i 

Proof; Ijet j «= t + (X — 1)A;. Then j is a monotone increasing function of i 
and X. The minimum value of j occurs when f = 1, In that case i == 1. The 
maximum value of j occurs when i = A;, X = w — 5. In that case j = (n — 6)k, 
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With these limits each term of f on the left side of the equation occurs once and 
only once on the right side of the equation. Furthermore, no additional term 
occurs on the light side of the equation. 

Lemma 3. 

i,ot,y 6,i 

Proof: First apply Lemma 1 to X) , Xi^(y^i)k) and then apply Lemma 

a,y 

a<y 

2 to the resulting expression. 

Lemma 4. 

H Ifixj) + fixj+ki)] = (n - 1) z fiXy). 

i.j V 

Proof: Let m = j + kb. Then for any fixed value of b the minimum value 
of m occurs when j = 1. In that case m = + L For any fixed value of 6, 

the maximum value of m occurs when j = k{n — 5). In that case m = nk. 
The letter m will assume all integral values from + 1 to fcn, and hence, 

Z/fe) + Z/(^/+m) = Z/fe) + E/(Xm). 

6,i i,j d,j S.m 

If we sum b from n — 1 to 1 instead of from 1 to n — 1 in ^f(xm) we see that 

S,m 

HfiXi) + Z/(a:m) = Z /fe) + Z /(a^n) 

S,j S,m j—I m—/;( n—1)4*1 

+ ••• 

+ HfiXj) + Z fiXm) 

y—1 f?»-»A:41 

where the summations of xj are terms of X) fi^i) the summations of x^ are 

'k(n-~i) kn 

terms of Z/(®m). But Z /(a^>) + Z fixm) = Z /(a:») and hence 

<.m y-1 m-«:(n-«) + l » 

Lemma 4 is proved. 

Lemma 5. Let 

/(®i+(o—i)t)/(a^«+(T—1)*) ~ -d • 

*.a,7 

Tlutn 

^ = n Z [fMf - Z [/fe) - 

^ “ 2!) [/(^»4-f«—l)Ai)] "t" 2 /(^t+(7—!)*:)• 

t,a t,a,7 

«<7 


Proof: 



SYSTSUATIC SAMPUNQ 


23 


By Lemma 3 

2 2 f{Xi+(a-l)k) = 2 23/(®j')/(Xj+lt) 

i*a,y b,i 

«<7 

and since we have 

2/(*,)/(xy+,t) = fiXif + fix^kf - l/(x/) - /(Xy4tt)]*, 

the proof is completed by using Lemma 4. 

Lemma 6. Let knf = £/(x,). Then 

V 

(i^) ? - /] - 2IM) - /fe+w)]. 

Proof: This lemma is a direct consequence of Lemma 5 . 

Lemma 7. 

^ = Z [/(x,) - /]* + 2E [/(Xy) - /][/(x,+«) - /] + 

Proof: From Iwemma 4, it follows that 

A = rZ/(x.)“ - E ([/(Xy) - + f/(Xy+M) - rfi+ 2E [/(Xy) - /]f/(xy4«) - /] 

r j,i \ 

and hence, from Lemma 3, it follows that 
A = nE[/(xv) - /]“ + nV - (n - l)i: (/(x,) - /f 

+ 2E[/(Xy) - /]I/(xy+*j) - /], 

whence the lemma is proved. 

Lemma 8. 

Ki) - " (i) ? '■'<"’> - •'*’+te>) 5 

This lemma is a direct consequence of Lemma 7. 

Lemma 9. Ifh> kn, let xh equal xn-kn • /(«, v) = /(v, m) t.c, / is symmetric. 

Then, if we let 

dki = ]C/(xy, Xy+te) 

i 

it follows that 

dkh + d*fi-a = a?r4w). 

9 

Proof: Obviously 

E /(x» > x,+«) = dti + B, 
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where 

kn 

B = 2!) f(^Q 9 

Now, let A = ^ — (n — S)k. Then 

ik 

B “ fiXh^(n—6)k ) ^A+fcn)* 

Since a:A+*» = x*, and/(**+.(„_«*, Zh) = f(xk , xa+(„-«*), it follows that B = dkn-i 
and the lemma is proved. It is noted that the ssmimetry of /(«, v) is necessary 
as well as sufficient, for if /(x,, x, 4 Ai) = x, — x,+a< the theorem is false. 

Appendix C 

Slratijied Sampling 

Let the population P consist of L strata Pi, • ■ ■ , Pl ■ Let x be the arithmetic 
mean of P, and £a the arithmetic mean of Pa . Let fa be the sample estimate of 
f«, and let i * 2 c«fa. Then = ]^ CaAa — A where — Aa . Let 

a a 

ffi be defined by at = €(f — x)*. Then <rj == &(x — Af + (A — xf and hence 
it is easy to see that cl = ^ CaCiCi,ii, + (A — f )* where 

a.6 

— A,)(fj — Aa), 

(A - 2f = 1^2 {^»Aa - ~ x,^j, 

and if NCa = Na , then 

(A - X)* = ]C CoCA(Aa - X„)(Aa - Xa) 

a,b 

and (Tf = 2 ^aCbtr^aib 

a,b 

where 

c\,l^ = §(fa - Xo)(xa - ib). 

These formulae hold whatever may be the method used in sampling the fth 
stratum. If f is an unbiased estimate of f and Xa is independent of f a , then the 
usual formula <r| = X) holds. The formula for will, of course, depend 

a 

on whether a random, cluster, systematic, or other sampling procedure is used. 



ON THE PROBABIUTT THEORY OF LINKAGE IN MENDELIAN 

HEREDITY 

By Hilda Geihingeb 
Bryn Mawr Colleger 

1. Introduction. If for a certain generation the distribution of genotypes is 
known and a certain law of heredity is assumed, the distribution of genotypes in 
the next generation can be computed. Suppose there are N different genotypes 

s 

in the nth generation in the proportions • • • , where S — 1 and 

t»i 

denote by pix the probability that an offspring of two parents of types k and X 

be of type i where 21 = 1 for all k and X, and p\\ == pL . Assuming pan- 

mixia, identical distributions for males and females, etc., we can derive 
from x<”^ by means of the formula 

(1) = i: (i = 1,2, ■•• N). 

it.X-1 

Thus if the distribution xf^ is given for an initial generation we can deduce suc¬ 
cessively the Xi^\ xi^\ • • • for subsequent generations. Ifesides, one may wish 
to express the xl”\ for any n, explicitly in terms of the initial distribution xf^, 
i.e. to “solve’^ the system (1). A further problem consists in determining the 
limit-distribution of the genotypes lim xj”^ (i = 1, • • • , N). 

n“*oo 

Menders heredity theory is based on some ingenious assumptions which are 
known as Mendel’s first and second law. They enable us to define the possible 
genotypes and to establish the recurrence formula (1); they will be explained and 
formulated in sections 2 and 3. It is well known that in Mendel’s theory it makes 
an essential difference whether one or more ‘‘Mendelian characters” are con¬ 
sidered. In the first case Mendel’s first law only is used; there are with respect 
to this characjter but N = 3 different types and the recurrence formula (1) can 
be derived without difficulty. As early as 1908 G. H. Hardy [5] established the 
simple but most remarkable result that under random breeding a state of equilib¬ 
rium is reached in the first filial generation, i.e. Xi^^ ^ xj"^ (in general) but xl”^ = 
(n = 2, 3, • • • ,)/ 

In the case of m ^ 2 Mendelian characters Mendel assumed independent 
assortment of these characters (Mendel’s second law). However, within four 
years after the dramatic rediscovery of Mendel’s fundamental paper [10], observa¬ 
tions were reported that did not show the results expected for two independent 
characters. T. H. Morgan [11] and collaborators in basic contributions, con- 

^ See also [12] where the stability of the particular ratio 1:2:1 is recognized. 
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eluded that a certain linkage of genes was to be assumed.^ Taking that as the 
starting point, the main purpose of this paper is to establish the basic recur¬ 
rence formula for the general case of linkage, to solve the corresponding system 
of difiference equations, and to determine the limit distribution of genotypes. 
Throughout the paper “multiple alleles’’ are considered instead of making the 
frequent restriction to two alleles. This generalization is, however, an obvious 
one (see section 1). 

In order to deal mth the general problem a linkage distribution (l.d.), is in¬ 
troduced. This concept, which seems to be basic to the whole problem, refers 
to the probability theorj’^ of arbitrarily linked events [3]. The crossover pro6- 
abilities^ (c p ), defined b}'' Morgan and Haldane are, notwithstanding their high 
importance, not sufficient for our purpose. (They turn out to be certain mar¬ 
ginal distributions of the l.d.) If, however, m = 2 and iV = 10 (for two alleles), 
a case studied by W. Weinberg [16] H. S. Jennings [7] and R. B. Robbins [14], 
the c.p. is equivalent to the l.d. But for the general case the l.d. is needed and 
the desired results must l>e derived by other methods than explicit computation, 
which is feasible if m equals one or two. The original problem of independent 
assortment appears, of course, as a particular case of the general linkage. This 
problem was completely solved in 1923 by H. Tietze [15] in a very interesting 
but rather involved paper. The proof of the limit theorem given in the follow¬ 
ing pages for the general case is far simpler and shorter than the treatment of the 
particular case in the older paper and is therefore a simpler proof of Tietze’s 
theorem. 

After a brief consideration of the classical case w = 1 (section 2), the problem 
of m arbitrarily linked characters is discussed in section 3 with a particular view 
to a clear statement of the biological and mathematical assumptions. The l.d., 
its relation to the c.p., and some basic properties of both are considered in sec¬ 
tion 4. Then, after a very concise consideration of the case m = 2 (section 5), 
the basic recurrence formula is established in section 6 from which we deduce in 
section 7 two general limit theorems. The main point is that the limit dis¬ 
tribution of genotypes is “uncorrelated” and equals the product of certain 
marginal distributions of first order deduced from the distribution for the first 
filial generation. As a kind of an appendix section 8 contains the solution of 
the system of equations furnished by the general recurrence formula. 

In the second part of the paper an attempt is made to contribute to the linear 
theory or theory of the linear order of the genes, from the point of view of prob¬ 
ability theory. Accordingly, the linear theory consists in certain assumptions 
on the l.d., or on an equivalent distribution which will be called crossover dis¬ 
tribution^ (c.d.), and which is more appropriate for this purpose. (Sections 9 

* T. H. Morgan and his associates and students is due the credit for opening up this 
new field of genetic research; and the small vinegar fly Drosophyla Melanogaster upon which 
most of their work has been based, has now assumed as great an importance in genetics as 
the famous peas studied by Mendel.” (Sinnot and Dunn, Principles of Oenetics^ New York 
1939 .) 
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and 10.) In this connection in section 10 a probability definition of the 
tance’^ dij of two genes is proposed which, far from being contradictory to Mor* 
gan’s ideas on the subject seems to formulate them mathematically; (the di^ 
tance between two genes i and j is defined as the mathematical expectation 
of the number of crossovers between i and j). This distance is of course additive 
as it ought to be in the framework of the linear theory. , 

A problem frequently discussed is whether the crossover probabilities are 
independent of each other (this independence is not identical with MendePs 
free assortment). Observations (see [4a]) did not seem to substantiate this as 
a general assumption. Then it was concluded that there exists a so-called 
interference which prevents, i.e. diminishes the probability of crossovers too 
“near” to each other. (See also [13].) It seems to the author that observations 
on interference should be interpreted in terms of appropriate assumjptions re¬ 
garding the l.d. or the c.d. Again the remark holds that the c.p.\s are not suffi¬ 
cient for describing the situation. Hence in section 11 an attempt is made to 
understand “interference” by means of the c.d., accepting however the linear 
theory. It is well known that the explicit presentation of consistent dependent 
distributions is not trivial (see e.g. [2]). Not many different types of “conta¬ 
gious” distributions are known. In section 11 two such schemes are propoi^ 
which, though simple enough, seem to correspond to the general idea of, inter¬ 
ference. They contain as particular cases the case of independent and the case 
of disjoint crossovers. 

2. One Mendelian character. Hardy’s theorem. It will be helpful to start 
with the simple and well known case of one character introducing the basic con¬ 
cepts in a way appropriate for generalization. 

Mendel recognized that the distribution of certain hereditary attributes in 
organisms is similar to the distribution of attributes in a probability distribution. 
With respect to a Mendelian character each individual is characterized by tu>o 
elements called genes which represent two possible alternatives. The color of 
the flower of peas is such a character, the alternatives being red and white. 
With respect to this character each plant belongs to one of the three types: 
red-red, red-white, white-white,^ These are three different genotypes ,—In this 
paper genotypes only will be considered. The difference between genotypes and 
phenotypes and the related concepts of dominant and recessive qualities will 
not be dealt with. This is an example of a two-valued Mendelian character, i.e. 
a character for which only two possibilities exist or, using a more technical term, 

* It will be assumed throughout that the individuals considered are “diploid*’. That 
means in the terminology of the preceding example that the only possible types are RR, 
RW, and WW; or, using A and a: AA, Aa, and Modern research has however revealed 
that situations may arise where “tetraploids*’, “hexaploids”, etc. briefly “polyploids” 
prevail, i.e. types like (with x-h y ^ 2p). In this case the reproduction cell segregates 
(with Zi -f « p). Stability is no longer reached in the first filial generation. See 

[4b]. 
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with two alleles. The case of two alleles is most frequently considered in the 
biological literature where the two possibilities correspond mostly to a dominant 
and a recessive quality. There is, however, no difficulty in considering from the 
very beginning the general case of multiple alleles where the character under 
consideration is assumed to be r-valued, i.e. susceptible to r different manifesta¬ 
tions (e.g. r = 5 poasible colors of a plant). These r possible values may be 
distinguished by the r arguments, 1, 2, • • • r/ 

In the consideration of only one Mendelian character MendeVs first law only 
is used which may he stated as follows: 

(a) . With respect to one r-valued Mendelian character each individual 
belongs to one of the r(r + l)/2 possible types, each type being determined 

by a pair of elements (genes) x and y 

(b) . In the formation of a new individual each parent transmits one of its 
two genes to the new individual, the other gene coming from the other parent. 

(c) . The probability for the transmission of either gene is the same and 
thus equals 

We wish to deduce the distribution of genotypes in the (n + l)st generation 
from the distribution of genotypes in the nth generation under the assumption of 
complete panmixia (random breeding). Moreover, assume that the given initial 
distributions of genotypes as well as the laws of heredity are the same for males 
and females!* In computing successively the new distribution from the pre¬ 
ceding one we shall always assume that the distribution of individuals partici¬ 
pating in the process of procreation is the same as their distribution when born. 

Let us denote a genotype by (x; ?/), (x = 1, • • • , r; 2 / = 1, • • • , r). To fix 
the ideas we shall assume through this paper that the gene x before the semi¬ 
colon was transmitted by the mother, and the y after the semicolon by the father 
of the individual. In some cases which will be considered later this distinction 
will be relevant. Denote by (x; y) the probability of the type (x; y) in the 
nth generation. Since the laws of heredity are the same for males and females 
we have w^^^ (x;y) = (y; x) and thus have for each generation a symmetric 

distribution of genotypes with r*” probabilities whose sum is one. There is, 
however, according to principle (a) no difference between the types (x; y) and 
(y; x) and therefore it is preferable to group together these types, thus intro¬ 
ducing for X = 1, • • • , r; y = 1, • • • , r: 

*) = x) 

(2) 

y) = y) + x) where x < y. 

is simplest to deal with mere pairs of alternative conditions (alleles) but a theory 
remains seriously inadequate unless capable of extension to multiple alleles. ” ([17] p. 224). 

* It is obvious that we may admit without any change of result different distributions for 
males and females in the initial generation, as long as random mating takes place after¬ 
wards. 
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Consequently there are r(r + l)/2 such probabilities: 


(20 «'"'(!; 1), 2), • • • r), »‘’‘'(2; 2), • • • v^”\2; r), 








where 


(3) 


Z y) = 1 (n = 0,1, 2, • • •)• 

x^V 


Now define p^”\x) as the probability that in the nth generation a male (or a 
female) individual tranamita the gene x. Obviously we have: 


(4) 

and 


p<">(x) = j X) + i t;'“>(2; x) + • • • + x) 

+ i X + 1) + • • • + i ^‘’‘’(x; r) 


(40 




In fact, the gene x will be transmitted, if an individual possesses this gene and 
also transmits it. The individuals of type (y; x) (or (x; y)) all possess the gene 
X and transmit it with probability i if y x and with probability 1 if y = x. 
Besides, the probability of the type (x; y) in the (n + l)st generation is ob¬ 
viously p'"'(x)p^'‘’(y): 

(5) ta<"+‘>(x; y) = p"‘'(x)p'”>(y) = x) 

or in terms of the i>'"’(x; y) 

^ :;"*+"(x;x) = [p<"'(x)]* 

t)‘’*‘^“(x; y) = 2p'"’(x)p'"’(y) (x ^ y). 

Hence by (4) and (50, t'*"'^** has been expressed in terms of u'"' and the recur¬ 
rence-problem is solved. The distribution ti>‘"''’‘^(x; y)(n ^ 0) shows “inde¬ 
pendence,” and is therefore known to be stable. In fact, computing in the same 
way p‘"'''“(x) we get 

p<"+«(x) = ^2p<’‘’(l)p‘"'(x) -[-••• p^’‘\x)p^”\x) 

+ i-2p‘">(x)p'”>(x + !)+•••+ i-2p<'‘>(x)p'"’(r) 

= p'"’(x)-i:p^"’(p) = p'-'cx) 

p-i 

or 

p‘"+”(x) = p'"’(x) 

(n = 0,l,2, •••),(* = 1,2, •••r). 

This last fomula contains G. H. Hardy’s famous result [5] thcd p^’‘’(x) ia the acme 
for aU n: 


(7) 


p‘"’(*) = P‘”(*) 


(n= 1,2, ...,) 
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and because of (5'): 

(70 y) - v^"(,x; y) (x ^ y, n = 2, 3, •• • ). 

In case of only one Mendelian property the distribution of genotypes reaches a 
stationary state in the first filial generation. 


3. Basic assumptions in case of m Mendelian characters. A new situation 
presents itself if there is more than one character. In case of m characters a 
genotype is described by 2m numbers (xi, • • * , ; 2/i > * * * » 2/m) or briefly {x) y) 

(e.g. for m = 5, r = 9:(1,2,3,4,6; 2,7,3,5,9). There are primarily N = 
possible types because on each of the 2m places any of the r numbers can be 
written. Now, if the types {x] y) and (i/; x) are considered as identical genotypes, 

r^ 

the number of different genotypes reduces to N\ = + 1) (e.g. for r = 2, 

A 


m = l:Ai = 3;forr = m = 2: A'l = 10). It is essential for the understanding 
of linkage that in counting this way two types like (1,3; 5,7) and (1,7; 5,3) or 
(1,1;2,2) and (1,2;2,1) are considered as diffei’ent although in both cases the 
individual possesses with respect to the first character the gene pair 1,5 and with 
respect to the second the pair 3,7. If no difference is assumed between two such 

fr(r + 1)^ 


types the number of different genotypes reduces to N 2 


- 


(E.g. for 


r = 2:N = 4"*, Ni = ^•2"(2’” + 1), N 2 = 3"; hence for m r == 2:A' = 16, 
N, = 10, N 2 = 9 or for r = 2, m = 3:N = 64, Ni = 36, N 2 = 27). Which 
method of counting is the correct one? 

The answer is that there are but N 2 different genotypes if MendeVs second law, 
the law of independent assortment, is accepted. Then and only then there is no 
difference between types like (1,3:5,7) and (1,7:5,3). Under the assumption 
of general linkage however, these types must be distinguished, not as individuals, 
but with respect to their heredity properties, i.e. considered as parents of a new 
generation. Under this assumption there are in general N\ different types. 
This will be discussed presently in more detail. 

Let us first consider Menders original theory as contained in his first and 
second law. Analogous to (a), (b) and (c) in §2 we now formulate as follows: 
(a') With respect to m characters the genotype of an individual is char¬ 
acterized by m pairs of numbers. Two individuals are of the same type if to 
each of the m characters corresponds the same pair. Hence there are N 2 == 
^r(r + 1)'^ 




genotypes. 


(b') In the formation of a new individual a parent of type (xi, • • • , x^l 
2/1 > * * * » 2 /m) transmits to the offspring, corresponding to each of the m char¬ 
acters, one of the two genes which he (or she) possesses with respect to this 
character. 

(c') The probability of transmitting any of these 2”* combinations is the 
same and therefore equal to 1/2’”. 
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Consider e.g. the individual (1,2,3:1,4,7); the pair 1,1 corresponds to the 
first character the pair 2,4 to th^ second and 3,7 to the third. Under the 
assumptions of MendeFs original theory this individual is of the same type with 
(1,4,3; 1,2,7) and (1,2,7; 1,4,3), and of course with (1,4,7; 1,2,3), etc. 
As m = 3, it may transmit eight combinations which in the preceding example 
reduce to four, because the individual is homozygous in the first character. 
These four combinations are 1,2,3 or 1,4,3 or 1,2,7 or 1,4,7 each with prob¬ 
ability 2 X i = J. 

The distribution of genotypes in successive generations under the assumption 
of MendePs second law has been investigated by H. Tietze [15] who also con¬ 
siders the limiting distribution as n oo . His results will appear as a particular 
case of our general considerations. 

In order to discuss the basic facts which lead to the idea of linkage let us for 
the moment consider the case m = 2. Soon after the rediscovery of MehdePs 
work Bateson and Punett reported observations which did not give the expected 
numerical results. To understand the type of such an observation assume that 
a homozygous male of type (1,1; 1,1), [or any other homozygous type, e.g. 
(2,3; 2,3)] is mated to a homozygous female of type (2,2; 2,2) [or to any homo¬ 
zygous type different from the first e.g. (4,6; 4,5)]. Obviously, in this case there 
is only one possible kind of offspring namely (2,2; 1,1), [or (4,5; 2,3)]. But 
if now one of these daughters is mated to a homozygous male of the original 
type (1,1; 1,1), there are four kinds of possible offspring, namel}" (2,2; 1,1), 
(2,1; 1,1), (1,2; 1,1), and (1,1; 1,1), corresponding to the four combinations 
of genes transmitted by the heterozygous (dihybrid) daughter (or (4,5; 2,3), 
(4,3; 2,3), (2,5; 2,3), and (2,3; 2,3)]; and according to the idea of free assort¬ 
ment each of these four combinations should appear with the same relative 
frequency: i. But it was observed that the combined frequency" of the two 
types (1,1; 1,1) and (2,2; 1,1) was larger than that of the types (2,1; 1,1) 
and (1,2; 1,1). “The characters that went in together have come out together 
in a much liigher percentage than expected from MendePs second law, viz. the 
law of independent assortment^’ [11]. Morgan, in his theory of the gene called 
this “tendency” linkage. The idea is that the two genes 2,2 and 1,1 which 
have been together in the maternal individual tend to stay together and that 
nature has to make an effort to produce a so-called crossing-over, i.e. a separation 
of the genes “that came in together,”—such that a female of type (2,2; 1,1) 
may transmit the group 1,2 or the group 2,1. In other words, the idea of linkage 
implies an influence of the grandparents. 

According to observation the percentage of crossing over varies from 0 to 50 
per cent, i.e. from complete linkage to free assortment. It will appear however 
that in principle crossover-values greater than 50 per cent cannot be excluded. 
It was also observed that the percentage of individuals of type (1,1; 1,1) equals 
very nearly that of individuate of type (2,2; 1,1), as we would expect. In the 
same way the percentages of types (2,1; 1,1) and (1,2; 1,1) are nearly equal, 
their sum yielding the crossover-ratio. Hence the four probabilities correspond- 
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ing to the formation of the four types (1,1; 1,1), (2,2; 1,1), (2,1; 1,1), and 
(1,2; 1,1) are assumed to be (1 — c)/2, (1 — c)/2, c/2, and c/2. It is important 
to notice that these are at the same time the probabilities that the female of 
type (2,2; 1,1) (which was mated to the homozygous (1,1; 1,1), transmits the 
groups 1,1 or 2,2 or 2,1 or 1,2 respectively. 

In the general case of m characters there are crossover 

probabilities. In this case Morgan assumes linkage-groupSy each group consist¬ 
ing of Mi elements with ^ such that ‘‘there is linkage between the 

t 

elements of each group but thajt the members belonging to different linkage 
groups assort independently, in accordance with Menders second law.’' This 
idea will be reconsidered in Section 4. 

If we now wish to solve our first basic problem, i.e. to derive the distribution 
of genotypes in any later generation from an initial distribution of genotypes, 
then the concept of crossover probabilities does not suffice. The complex 
possibilities which arise if Mendel’s second Islw is no longer accepted as universally 
valid cannot be adequately described in terms of crossover probabilities. Or, 
more exactly: It will be seen that if m ^ 4 the crossover probabilities are no 
longer sufficient, w^hereas for m = 2 and m == 3 this concept is general enough. 
For the complete description of the hereditary mechanism in the general case a 
so-called linkage distributiony l.d., is needed which involves 2'"' probabilities with 
sum equal to one. Let us define this distribution. 

Consider an individual of t 3 ^pe (xi, • • • , yi, • • • , y^) ^ (-r; ^), where the 
X are the maternal genes, the genes contributed by the mother of the individual, 
and the y the paternal genes. Denote by S the set of the m numbers 1,2, • • • , 
niy by A any subset of S, and b}^ A' the complementary subset A' = S — .4. 
Denote by 1(A) the probability that the individual (.r; y) transmits the paternal genes 

belonging to A and the maternal genes belonging to A\ There are 1+ m + 

+ ••• + 1 = 2'” such subsets A and accordingly 2”* probabilities 1(A) where 

(8) Z 1(A) = 1. 

(A) 

In accordance with the pre^dously reported observations and with our assump¬ 
tion of equal conditions for both sexes one must assume that 

(80 1(A) = l(A'), 

The conditions (8) and (80 reduce the number of freely disposable values of the 
/-distribution to (2'”’"^ — 1). The /-distribution is a socalled m-dimensional or 
m-variate alternative which could also and occasionally will be denoted by 
, € 2 , • • * , €m) where €* = 0 or 1. Thus e.g. /(1,1,1,0,0,1) is the probability 
that the genes 2/i > 2/2 > 2 / 5 , 2 : 4 , X 5 ,2/6, are the genes contained in the germ cell of 
the individual (x; y). Here the set A consists of the numbers 1,2,3,6, and A' 
of 4,6. 
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Analogous to the statements (a), (b), (c) of $2 and (a0> (b0> (cO of the present 
section we may now formulate the principles of MendeTe theory of heredity under 
the assumption of a possible linkage of the genes; 

(a") With respect to m Mendelian characters an individual is characterised 
by two sets of numbers each consisting of m numbers, vis. Xi, ■ ,,Xm and V \, 


• • • , , where ’ = 1,2, • • • , r. If the type of an individual 


by (xi, • • • , , • * * , 2/m) s y) where x and y denote the maternal and 

paternal contributions respectively, then {x] y) » (y; x). Hence there are 
N\ *= +1) types of individuals. 

(b") SB (b') In the formation of a new individual each parent transmits to 
the offspring one set of m genes. 

(c") For each parent, the 2*” probabilities of transmitting any one of these 
2”* possible sets are given by a linkage distribution 1{A) where is a subset 
of the set S consisting of the m numbers 1, 2, • • • , m, and 1{A) is the prob¬ 
ability that the transmitted set consists of the paternal genes belonging to A 
and of the maternal genes belonging to A' = S — A, and /(A) = Z(A'). 


4. Some properties of the linkage distribution and of the crossover probabili¬ 
ties. In the following we shall need marginal distributions, that is partial sums, 
of the probabilities within a distribution. In a usual notation: 

= 23 2 ‘ f ^2, * • * , Xm), . 

*2 *1 *in 

ll2(Xl , 0 : 2 ) = 23 • • • 23 l(Xl , X2, * • • , Xm), . 

(9) 

^128* • wi— 1(^1 ) X 2 j * * * j Xm~l) ~ 23 X 2 f • • * , Xm), 

Z 12 X 2 , • • •, Xm) = l(xi, X 2 y • • •, Xm).the original distribution. 

These are general formulae for any discontinuous distribution. But if the dis¬ 
tribution happens to be an alternative, as the l.d., where x, takes only two values, 
any marginal distribution can be completely characterized by two subsets A 
and Ai of S where A Z) Ai. Denote by Z^(Ai) the sum of all possible linkage 
probabilities which contain all points of A\ and no point 0/ A — Ai. If, e.g. m = 8 
and A consists of 1,3,5,6 and Ai of 1,3,6 then Z^(Ai) = kvu (1,1,0,1) = 
23 l(h X 2 , 1, X 4 , 1, 0, X 7 , xg). According to the previous notation we have 

* 2 .* 4 »* 7.*8 

as usual 


. m distributions 


(™) 


distributions 


... . m distributions 


ifl(Ai) = l{Ai), or h,t .»(a:i, • • • , a:«) * Z(xi, • • • , x«) 


( 10 ) 

and 


lo(0) == 1, if A = 0 is empty. 
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We will use for the linkage distribution and their marginal distributions the 
customary notations or these new notations, whichever is more convenient.® 

As an immediate consequence of our definitions we get the following properties 
of the l.d. 

(i) If (8) holds for any A than 

(11) Ia{Ai) « IM - Ai). 

(ii) As a cmsequmce of (8) it follows {with the notation (9)) that 

(9') lid) - ^(0) - i 

(iii) If Cij denotes the c.p. between i and then 

(12) Co- = cy,- = W1,0) + Zo(0,l) = 2/o(l,0) = 2Zo(0,l). 

(iv) For any three subscripts i, k the ^^iriangulaf^ relation holds 

(13) Cij + Cjk ^ Cik 
and 

(14) Cij + Cjk + Cik ^ 2. 

To prove this consider the marginal distribution UjkixiXjXk)^ From (11) and 
(12) we conclude 

Co = 2[Zo-i.(100) + Zo-*(010)] 

Crt = 2[Zo-fc(100) + Zo-ik(OOl)] 

cy* = 2[Zo-fc(010) + lijk{00l)] 

1 = 2[lijk{m) + Zo-fc(lOO) + Zo-;t(010) + Zo-ifc(OOl)]. 


• It is easy to indicate experiments which should furnish the relative frequencies corre¬ 
sponding to the l.d.: If a homozygous female (xi, • • •, Xi, • • •, Xm) is mated to a homo¬ 
zygous male (yi, • • •, 2/i, • • •, 2/m) where each Xi yt , the resulting offsprings will all 

be of type (xi, • • •, Xm; 2/i. ■ * •» Vn)- If such an offspring is back crossed to (t/i, • • • , y„; 
2/i > * • • , 2/m) there will be 2’" different genotypes of offsprings, viz. (xi , X 2 , • • • , x„; yi , 
2 / 2 , • • •, 2/m), (2/1 , , • • •, Xm ; 2/1 , 2/2, • • •, 2/m), ctc. whosc frequencies are proportional 

to the 2 ”* values of the l.d., viz. to Z(0, 0 , • • • , 0 ), Z(l, 0 , 0 , • • • , 0 ) etc. Such an experiment 
should give the same results for any two sets of x's and y’s. (There is, of course, the statis¬ 
tical problem how to determine the “best^^ values of the l.p. from these observations.) 
In an analogous way a marginal distribution can be observed: Suppose we wish for m 5, 
the Im (<i , « 2 , € 3 ). The offspring of a cross between females (xi , X 2 , Xs, Xi, x&; Xi, Xz, xs, 
X 4 , Xi) and males ( 2/1 , 2 / 2 , 2/*, , * 5 ; 2/i . 2 / 2 , y*, ^ 4 , Xi) are of type (xi, X 2 , Xs, X 4 , x^ ; 
2/1 , 2 / 2 , y*, ^ 4 , Xi). If they are crossed to (xi, • • • , Xs ; Xi, • • • , Xe) there will be eight 
different types of offsprings proportional to the eight values of Im («i , cs, es)- In this lajst 
setup the y** should be dominant and in the experiment, described above^, the y* should be 
recessive in order to be able to distinguish between the phenotypes of the individuals. 



PROBABILITY THEORY OF LINlLiGE 


35 


Solving these equations with respect to the Z-values we get 

lijkilOO) = } (Cij + Cik — Cjk) 

(15) /«jJt(010) “ i (C,y + Cjk — Cik) 

lijk(001) = J (Cik + Cyk — c,*y) 

(16) liMOOO) = i (2 ~ ~ - cy*). 

Thence (13) and (14) follow. The condition (14) is of course always fulfilled 
if Cij ^ j, but this restriction does not seem to be necessary. From (15) and 

(16) we deduce: 

(v) // m = 3, the set of three c.p, , Cia, Cm for which the inequalities (13), 
(14) hold is equivalent to the Id. l(xi , X 2 , x^ for which (8) holds. For m ^ 4 the 
c.p. are no longer equivalent to the l.d. Another necessary condition for the 
c.p. will be derived in section 8. 

Now let us consider and characterize some important particular cases of the l.d. 

(i) Free assortment (Mendel). In this case all 2"* values of the l.d are equal 
and therefore equal to (i)*”. 

(ii) Complete linkage (reported by Morgan and other authors). In terms 
of the l.d. this means 

(17) Z(1,1, ... , 1,1) = Z(0,0, . •. , 0,0) = I or ls(S) = i. 

Consequently, all other values of the l.d. are zero. It follows that all c.p. are 
zero because all Z,y (1,0) are zero. (See also Theorem I, section 7.) 

(iii) Linkage groups (Morgan). In terms of the l.d. this means that the l.d. 
resolves into a product of several distributions,.e.g. 

(18) l(xi , X 2 , • • • , == f(xi , X 2 )g(xs , X 4 , xi)h{xfi, x?, X8, xg). 

(There is no loss of generality in assuming that numerically consecutive charac¬ 
ters form a linkage group.) As /, g, and h are distributions it follows with nota¬ 
tion (9) that “within^’ the groups: 

Cl 2 = 2/(10), Cs4 = 2^84(10), • • • , C46 = 2^46(10), 

C67 = 2^67(10), • • • , C89 = 2/l8»(10). 

these crossover values are auite arbitrary. On the other hand we have because 
of (90 

fid) = fm - gjd) = ^y(o) = hkd) = hk(o) - i, 

a = 1, 2;i - 1,2,3;* - 1, •••4) 

Hence for the c.p. *‘among” the groups 

Ci8 = 2-J-J = §, etc. Hence Cw = cw = Cm = •••== Cw = i 

in exact accordance with Morgan^s idea of linkage groups. If each group con¬ 
sists of only one element: l(xi , X 2 , • * • , Xm) = f(xi)gix 2 ) • • • k(Xm) it follows 
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that/(xi) ^ g{x%) = • • • « fc(xm) - i for x< = 0, 1, hence l{xi , - • , Xm) ^ (i)"* 
for all combinations of the arguments and we have again free assortment. 

{iv) Groups of completely linked characters. Combining and generalizing the 
ideas .of (ii) and (iii) we may speak of i groups of completely linked characters if 
within such a group no crossover takes place. Then the mi characters in each 
group act as one character. An example will suffice. Suppose m = 9 and three 
such groups, consisting of the characters 1,2, and 3,4,5, and 6,7,8,9 respec¬ 
tively. Assume that 

z(ii. 111, nil) = z(oo,000,0000) = a, z(oo. 111, nil) = z(n,ooo,oooo) = Ci 
z(n,ooo,nn) = z(oo, 111,0000) = C2, z(n, 111,0000) = z(oo,oo, 1111) = Cs 

where these four numbers are 5^0 and with sum hence all other probabilities 
are zero. It follows that the c.p. ‘Vithin” the groups are all zero: Cn = 034 = 

• • * = C 46 == C 67 = * • • = C 89 = 0, but the '‘among'’ c.p. are different from zero, 
e.g. Ci 3 = Ci4 = C 16 = C 23 = C 24 = C 26 = 2 Ci + 2 C 2 and, with an obvious notation: 
CijT = 2(Ci + C2), Cijrr = 2(Ci + C3), Cnjji == 2(C2 + Cs). 

A particular case (also a particular case of (iii)) arises if the l.d. resolves into a 
product of some distributions such that there is complete linkage in each of these. 
The "within" crossovers are then again zero but all the c.p. "among" the groups 
equal 


6, The case m = 2. It will be easier for the reader if this case, though it 
has been investigated before by several authors [16], [7], [14], will be presented 
by means of explicit computations before attempting the general one where m 
and r are arbitrary. 

If m == r = 2, the number of types (xi, X 2 ; yi, 2 / 2 ) equals ten. The l.d. is 
complete!}'' determined by the c.p. C 12 = c and v.v., because Z(10) = Z(01) = c/2, 
Z(00) = Z(ll) = (1 — c)/2. Now let , X 2 ) be the probability that in the 

nth generation a male (or female) individual transmits the genes xi, X 2 ; and de¬ 
note by pi''\xi) and P 2 '\x 2 ) the respective marginal distributions. The formula 
corresponding to (4) then becomes 

p<">(l,l) = 1,1) + y’d,!; 1,2) + |t;'">(l,1; 2,1) 

+ ^■-%'"'(1,1;2,2) + |r<">(l,2;2,l), 

and three analogous formulae. To understand this, consider e.g. the last term 
of (19); it is the probability that an individual be of type (1,2; 2,1) or (2,1; 1,2) 
and transmits the set (1,1). By (19) p'"'(a:i, xt) is deduced from the given 
distribution d'”’ of genotypes. 

If, as before, x and y are written for Xi , Xt and yi, y% it is to be understood that 
X = y means Xi = j/i and Xi — yt. The relation corresponding to ( 5 ') takes 
then the form 


(20) 


y) = p‘"'(x)p‘"'(x) if x = y 

= 2p‘"'(x)p‘"’(p) if X 7 ^ y. 
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Applying (19) to the (n + l)8t generation and vising (20) we get the recuircaide 
formula 

(21) “ b'"’(l,l)]* + p‘”>(l,l)p<’‘’(l,2) + p'’*»(l,l)p‘"'(2,l) 

+ (1 - c)p‘">(l,l)p‘"’(2,2) + cp<’*»(l,2)pf*’(2,l). 

Here the right side can be rewritten so as to give 

(22) p^'-'^d,!) = (1 - c)p‘">(l,l) + cpi">(l)pi">(l) 
and three analogous formulae. Because of (7): 

(22') p‘'"*'”(xi ,Xi) - {1- c)p‘’”(xx, X 2 ) + cpi®’(a5i)p»®^(x*). 

From this recurrence formula, which has the particularly simple property that 
the second term on the right side is independent of n, it is easy to derive step 
by step: 

(23) p'">(x,, X*) = (1 - c)y'«(xi, xs) + [!-(!- c)"]pi®>(x,)p^®(x,). 
Hence, if c 5 >^ 0: 

(24) lim p^"'(xi,xs) = pi^^xOpi^^xt). 

n-*« 

The preceding results were obtained by Robbins and Jennings. We will formu¬ 
late a theorem after having studied the general case of arbitrary m and rj 


6. The general recurrence formula. Considering random mating and assum¬ 
ing general linkage, we now wish to find the relations which correspond to the 
formulae (19)-’(22) in the case of m r-valued characters. It will turn out, that, 
by using the l.d., the following proof of the general case becomes surprisingly 
simple compared with older investigations of the particular case of free assort¬ 
ment, the values of the l.d. acting somehow as natural “separators^' for certain 
groups of terms. 

Denote by , * * * , Xw, ; , • ♦ • , 2/m) = y) the probability of a 

genotype whose maternal genes are the x and whose paternal genes the y. Then 
from (a"): 

(25) w^''\x] y) = w^''\y] x). 

Writing X ^ y\{ and only if = y,-, (f = 1, • • • , m) we put just as in (2) 

t)‘"’(x; y) = «)‘"’(x; x), if x = p 

(25^) 

= u;^'”(x; y) + w'"’(p; x) = 2«)‘’’’(x; y), \tx 9 ^ y. 


’’ A suggestive remark, repeatedly made by Professor S. Wright states that (assuming 
random mating) there can be no equilibrium until all of the factors are combined at ran¬ 
dom. This is indeed a necessary condition for stability. 
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There are r®" i/i-values and + 1) j>-values in each generation the respective 

sums being always equal to one. Denote by , • • • , x„) the probability 

that a male (female) individual of the nth generation transmits the genes x, and by 

Pif(Xi, X,), • • • , pii}..„(xi, , Xm) = p‘”’(a:i , ■ ■ ■ , x„) 


the corresponding marginal distributions, defined as usual (see (9)). Sometimes 
it will be convenient to denote such a marginal distribution by Pa{za) ~ Pa{z) 
where -4 C <S, and Pa(z) is the sum of all p(x) such that Xi = Zi for all i e A, 
Following convention the subscript will be omitted if ^ = S; hence ps(z) = p(z) 
and if A is empty, ^ = 0, the corresponding po(z) = 1. 

To simplify the writing p(r), v(x; y), etc. will be written instead of 

y), etc. and p'(r), v'(x; y), etc. for etc. Finally, remember that 

1(A) is the probability that the pat-ernal genes of A and the maternal genes of 
A' — S -- A will be transmitted and accordingly Ia(Ai) is the (marginal) prob¬ 
ability that the paternal genes of Ai and the maternal genes oi A — will be 
transmitted. (S A "D A^), 

Let us derive p^(z) from p(z). From the meaning of the different distributions 
we gather that ^ 


(26) 


p(z) = Xl(A)w(x; y) 


where A is an arbitrary subset of S and x and y such that 


yi = Zi for i e A 

Xi = Zi ‘‘ i € A'. 


In fact, the set z will be transmitted if and only if an individual possesses these 
genes and also transmits them; now consider any 1(A) i.e. the probability to 
transmit the paternal genes of A; this probability is to be multiplied by all 
possible ic-probabilities which contain as arguments the paternal genes of A 
and the maternal genes of A\ as stated in (a). Now let us write (26) also for 
the (n + l)st generation: 


(26') p'(z) = mA)w'(x; y). 

Next we have, just as always, [see (5), (20)] 

< 

(27) w'(x] y) = w'(y\ x) = p(x)p(y). 

Hence from (26') and (27) follows 

(28) p\z) = n(A)p(x)p(y) 


with the condition of summation given by (a). 

The right side of (28) contains (2r)'” terms. Now we will write it in two 
different ways by collecting its terms under two different aspects: (i) arranged 
according to the marginal values of the /-distribution (ii) arranged according to 
the marginal values of the p-distribution. Let us begin with (i). 
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The genes 21 ,^ 2 , • , can be transmitted only by individuals which possess 

each Zi either before or after the semicolon or both; (either from the mother or 
from the father or from both parents). Hence, if At and A 2 are two disjoint 
subsets of S, the type of such an individual is such that 

Xi 3 ^ Zi , yi = Zi for all i € Ai 

(b) Xj-==Zj, yjr^Zj ‘‘ “ j€A 2 

Xk - yk - Zk keS - Ai - At. 

Hence the paternal genes of Ai and the maternal genes of At must be transmitted 
and for the remaining genes either choice is admissible. Consequently, each 
w'{x; y) in (26')—or, what is the same, each p(x)p(y) in (28 )—m multiplied by 
the probability that the paternal genes of Ai and the maternal genes of A 2 are 
transmitted. Now writing Ai + At — A this last probability is exactly the 
marginal probability Ia(Ai) = hiAt). Thence 

(29) p'(z) = lp(x)p(y)lAiAi) 

where the sum is extended over all pairs Xy y defined by (b). This is a first 
recurrence formula. If in (29) ty'(x; y) is ^Titten instead of p{x)p{y) and then 
all accents are omitted we get 

(30) p{z) * 'Zw{x\ y)lA(Ai) 

with the summation according to (b). This formula is necessary in order to 
derive p(z) from the given distribution w(x; y) of genotypes. It corresponds 
to (19). 

Now let us collect the terms of (28) in the second way. Let us determine 
the factor of any 1{A) in (28), e.g. of i(l,l,0,0,0) (where m = 5 and A the sub¬ 
set 1,2). Any factor of Z(1,1,0,O,O) must be of the form p{zi, Zt, •,•,•) 
p( -, • ,«8, Z 4 y Zb) where all possible values of the variables must be written on 
the empty places marked by points, and the sum of all these products is to be 
taken. Now, as in each of the two p’s on each of the free places all numbers 
between 1 and r have to be used, the sum of all tiiese products resolves into the 
product of the respective sums of the p’s. In such a sum each term, on the 
places belonging to A contains the same fixed values Za and on the other places 
any possible value combination; hence such a sum is precisely the marginal 
probability Pa(za) = Pa(z) and the same holds for the other sum of the p’s and 
for A' = S — A. Thus we get the second, even more important recurrence 
formula 

(31) p'(z) = Y.liA)pA{z)pAz) 

where the sum is over all subsets A of S. This formula corresponds to (22) 
and the limit theorem which will be proved in the next section is an almost 
immediate consequence of (31). It is worth noticing that the derivations of 
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(31) and (29) from (28) are completely independent of each other and that only 

(31) is needed for the limit theorem 

From (29) and (31) the. interesting identity follows 

(32) Y, li.A.)pA{z)pA'{z) = Sp(a:)p( 2 /)i-.(^i) 

<-») 

which somehow reminds us of a general Abel-transformation. 

, Let us summarize: (i) From a given distribution of genotypes (or «'“’) 

the p*"’ are derived by (30). (ii) From these p‘"* the w*"'*'*^ follow by (27) 
(or by (25') and (27)). (iii) Instead of step (ii), from p'"^ the consecutive 
p(n+i)^ p(n+ 2 )^ ^ may be derived directly by means of (31). Finally, 

if desired, follows by (27). 

As an illustration of these formulae let us write (31) for m = 3,4,6: 

p'(x, ,x^,x») = 2 (i( 000 )p(xi , Xi, xt) + l( 100 )pi®’(xi)p»(a:j, x») 

(31') + l(010)pr’(xi)p«(*i, x.) 

+ i( 001 )p»“(xi)pis(xi, xj)] 

p'(xi, X*, xj, xi) = 2[l(0000)p(xi, X 2 , x», X 4 ) 

+ l(1000)p{“’(Xi)p284(X2 , Xs , X 4 ) + • • • 

(31") + ((1100)pi2(Xi , X2)P34(X3 , X 4 ) 

+ Z(1010)pij(Xi , X»)P24(X2 , X 4 ) 

+ i(1001)pi4(Xi , X4)p2s(X2 , X 2 )] 

p'(xi, X 2 , Xs, X 4 , Xt) = 2[f(00000)p(xi, • • • , Xj) 

(31'") + l(10000)pl“’(Xl)P2M6(X2 , Xs, X4, Xs) + ••• 

+ f(11000)pi2(Xi , X2)P846(X8 , X 4 , Xs) + ••■]. 

In the last formula the last group contains ten terms. As an illustration of 
(30) we write e.g. for wi = 3, r = 2, with p'"' = p and v'"^ — v: 

p(xi, X 2 , Xs) = t>(xi, X 2 , Xs ; Xi, X 2 , Xs) + Mi’(xi, X 2 , Xs J Pi, Xs, Xs) 

+ t;(xi, Xs, Xs Xi, ps, Xs) + • • •] 

+ [li2(00)t)(xi, Xs, Xs; Pi, Ps, Xs) 

(30') + lis(00)t;(xi, Xs, Xs; Pi, xs, ps) + • • ■] 

+ Z(000)»(xi, Xs, Xs; Pi, Ps, Ps) 

+ ll(100)t)(pi, Xs, Xs; Xi, Ps, Ps) 

+ l(010)t>(xi, Ps, Xs; pi, Xs, Ps) + • • •]• 
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, Xm) we write the recur- 


7. Limit theorems. In order to find lim , 

n-*co 

rence formula (31) in the form 

(33) p<"+“(x) - 21(00 •••0)p‘">(x) * E'^(^)pi"'(*)pi*’(«)- 

(A) 

Here means a sum over all subsets A of S which are neither void nor equal 


(A) 


to S, If we write for the right side of (33) and 
21(0, • • • , 0) » am the last equation takes the form 

(34) 


Pm , 


- amPm^ = 


Consider first the case am = 1 , or Z(0, • • • , 0) = Z(l, • * * , 1) — i, i.e. compleU 
linkage^ as defined in section 3. In this case all Z(A)-value 8 on the right side 
of (33) are zero, hence = 0 and 

(35) pi"-^“=pl’” (n - 0, 1, 2, •••). 

This is exactly the same result as (7): All pl“’ are equal to pi®’ and because of 
(27) also 

(36) w'"*(a:;p) = w^^\x;y) or v'”\x-,y) = r*'’(x;p) (w == 1, 2, •••). 

In fact, if the characters are completely linked, they act as one character. Hence 
we have 

Theorem I . If the m Meridelian characters are completely linked, the distribution 
of genotypes reaches the stationary state in the first filial generation. 

Now consider (34) in the general case where 0 g am < 1. Then the following 
lemma will be used: If in a recurrence formula of the form (34), | am | <1 and 
lim = gm exists, then lim pll'^ = Pm = qm/{l — otm). This can be proved 

n—*00 n—*00 

directly in various simple ways. It may also be regarded as a consequence of 
well-known general convergence theorems. See also [15]. 

In order to apply the lemma let us first notice that q^ exists. In fact, 

, X 2 ) — 2 /( 00 )p^”^(xi, x-y) = 2 l( 0 l)pi^\xi)p 2 ^\x 2 ) and as the right side 
is independent of n, 52 certainly exists. Hence, it follows from the lemma that 
P 2 exists. For m = 3 the recurrence formula (31') shows that q^^^ contains no 
marginal distribution of p of an order higher than two; therefore each of the 
terms of gs”' approaches a limit, hence qz = lim qz""^ exists, and consequently, 

because of the lemma, pz exists. We may continue in this way because in (33) 
all marginal distributions of p on the right side are of an order g m — 1 . Hence 
for every m the qi!"^ approaches a limit and consequently the lim exists. 

n—*oo 

Finally, in order to find p„ we notice that g 2 = (1 — ai)pi°\xi)p 3 '’^(xt), hence 
Pi = Px“’(a;i)p 2 ®’(a;s). Then, assuming that pm_i = pi“'(xi) ••• prii(a:«_i) 
we see from (33), using (8), that g™ = (1 - a„)pi°\xi) • • • Pm’(a:«). (See 
also (310 (31"), (31'")-) Thence 

(37) lim p‘"’(xi, X 2 , • • • , a;*) = p{®(a!i)P 2 “*(a: 2 ) • • • Pm\xm). 
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The last formula contains the limit theorem we wished to prove. It can be 
stated as follows: 

Theorem II. If m characters are arbitrarily linked, with the one exception of 
'^complete linkage'\ the distribution of transmitted genes , • * • , Xm) ^'con¬ 

verges towards independence '" The limit distribution is the product of the m 
marginal distributions of the first order Pi^\xi), which are derived from 
p^^\xi , • • • , Xm), the distribution of gametes in the initial generation. 

If, however, the initial distribution p^^\xi , • • • , shows particular features, 
the stationary state may be reached already for a finite value of n. This happens 
with w = 0 and for every l.d. if p^^\xi , • • • , Xm) = * • * Pm\xm)- In 

other particular cases it may happen under particular assumptions for the l.d. 

Let us express the general result also in terms of the distribution of geno¬ 
types. It follows from (37) and (27) that 

lim y) = lim p^”^(x)p^”\y) 

n-^ce n-*oo 

= Pi‘\xi) ••• Pm\x„)pi^\yi) ••• = n 

iiml 

Now consider a product like Pi^\xi)pi^\yi), By definition of pi°^(xi) and ap¬ 
plying (27) we find 

Pi®(a:i)p)[®'(j/i) = Z • • • 23• • •.23 • • • 23p"”(^i, ,ym) 

*2 *m */2 Vm 

= 23 23 p‘“’(a:)p‘®’(p) 

Introducing then in a natural way the marginal distribution: 


; y,) 


(38) 






; 2/1 > 


> 2/m) 


it is seen that 

(39) Pi‘\xi)pi‘^(yi) = wTiXi ; j/,). 


Thence the result 

(40) lim«)‘"’(a:i , ■ • • , x„ ;yi, ■ ■ • ,y„,) = Wi\xi ;yi) • • • Wm\x„ ; y„) 

n--»oo 


which may be stated as follows: 

Theorem III. In case of m arbitrarily linked Mendelian characters the distri¬ 
bution of the genotypes in the nth generation, , • • • , Xm ; , • • • , 2/m), 

''approaches independence" as n ^, The limit distribution is the product of 
the m marginal distributions wf^{xi; yi) of the ith character {i = 1, • • • , m) 
in the first filial generation. 

This theorem, which may be regarded as a corollary to Theorem II, holds for 
any type of linkage, except ‘‘complete linkage’' as defined in (17) where (36) 
is valid. 
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8 * Solution of the recurrotice equations (31). Formula (31) expresses 
, • • • , Xm) in terms of , • • • , Xm) (and all marginal distributions 

of and of the l.d. It seems desirable to try to express in terms 

of v^^\x). Now (31) is not a single equation but rather a complex system of 
difference equations with constant coefficients because for each marginal distribu-* 
tion of order i < m the respective recurrence formula (31) of order i has to be 
used. (Or, if it is preferred to consider the marginal distributions as sums of 
p-values of order m, then all these p-values appear simultaneously and there is 
again a complicated system of difference equations.) In this situation it is not 
to be expected that the integration will yield simple explicit formulae, partic¬ 
ularly as long as the l.d. is left arbitrary. However, the construction of the 
following formulae is clear. They reduce to simpler expressions in particular cases. 

Let us use a method of indeterminate coefficients. To simplify the writing 
denote , * • • , x^) and its marginal distributions Pi°^(3?t), , x,)^ etc. 

by pi 2 .m , Vi > Vi] i From genetical as well as mathematical considerations 

we gather the general form of piJA .m in terms of and its marginal distribu¬ 
tions; that this is indeed the general form will be verified by our very computa¬ 
tions. Consider the set S consisting of the m numbers 1 , 2 , • • • m and divide 

5 in every possible way in two disjoint parts A\ and , none of them 

being empty, so that Ai + Ao ^ S, then divide S in every possible way into 
three disjoint parts so that A^ -f ^2 + A 3 — S, and finally S is divided into m 
disjoint parts each consisting of one single element. Denoting the unknown 
coefficients in a corresponding way by ^ 2.^8 > writing 

and Vs for Vn-^ m and ViV--m the general form of Vs will be 

OtS Vs *4“ 2 p.4 j p.42 ”i” 

f.4l) (•41.-4 2) 

+ * * * + afi,2?3.- -,mPlP2P8 ••• Pm. 

This holds for every m. We get e.g. for m == 4 

P1234 == OLiUiVvm A" (0fl.234PlP234 + oi^lZiV^Vl^ + * ’ •) 

(41') • ^ ^ . . 

+ (ai2"34Pl2P34 + •••) + («12,3.4Pl2P3p4 + '•*) + afi,2,3,4plP2P8P4 • 

For m = 6, e.g., there are eleven different types: One term ai^AePi.-.c; then 

6 terms of the form a{,?1..6PiP25...6 ; 15 terms like ai2,3466Pi2P8466 ; 10 terms like 
ai2”3.466Pl23P466 ; 15 tCrmS like ai.2!3456PlP2P8486 ; 60 termS like ai.?3.456PlP28p4M ; 
15 terms like ot{ 2 M,wVviVz^V^ ; 20 terms as aC 2 ,z,mViV^V^V^^ i 15 terms as 
ai2”34,6.6Pi2Ps4P6P6 I 15 tcmis as ai,2,3.4.86PiP2P8P4P66 ; ond onc final term 

ai,2?8,4,6,6Plp2P3P4P6P6 • 

In (41) the a!”? are unknown constants depending on n and on the l.d. 
In order to find them consider (31) and write for the values of the l.d. Dh instead 
of 21{A) (no confusion is passible because no marginal distribution of the l.d. 
occurs in (31)). With this notation (31") e.g. reads: 

(31") piJaV^ = I'oPim + {vXvxVnl + **•) + (t^i2Pi2*V3r^ + *•*)• 


(41) 
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If there is no ambiguity the upper miavA may even be omitted. Now assume 
the equations (41) to be written for = 2, /i = 3, • * • , m “ Introduce into 
the left side of (31) the expression (41) for and in the same way replace on 
the right side of (31) all pif, • • •, by their respective expressions 

(41). In this way an equality is obtained from which recurrence formulae for 
the unknown coefficients may be deduced by collecting all groups of terms which 
contain the same products of p’s. 

If this is carried out, e.g. for m == 4, the recurrence formulae are 


ai284 — Voai2U 


(n+l) _ ,, , „^(n) 

«128,4 — t;oai2s.4 “T t;4ai28 


(42) 0!i 2!?4^^ ~ i^0OJi?,84 “I" etC. 

4" + *^8^12,4 + t^4ai2,8 

Ofuia.V = *^0^1.2,V4 + + * * * + t^i2ai,2^al4^ + * • * 


In general, i.e. for any m, these recurrence formulae are of a clear structure the 
first one being particularly simple, namely 


(43) 


Ctg — VoCHs • 


It can be solved immediately and gives 
(430 “ fo" • 


The other recurrence formulae are aU of the form 


(44) a:„+i = VoX„ + f(n) with xo = 0, 

where f(n) is a given function of n whose general form is still to be investigated. 
The solution of (44) is 

(440 s Ef(»)vo-'-'. 

P-0 


With the notations used in (41) the equation (44) may be written: 


(44") 


^(n+l) _ , j(n) 

V0«4i,A2. *-.A,. “T -^Ai,A2. -.a 


(n) 


We have to determine A a,, a a^ . For reasons of symmetry and homogeneity 

let us introduce constants = ... = — 1. With that notation 

e.g. the last term in the second line in (42) reads or the third term to 

the right in the fourth line of (42): etc. 

The construction of .may then be described as follows: Each 

AAi,Ai .A„ is a sum of — 1 terms, each term being a product of one y-value 

and two a’s. The set consisting of the /x elements , .42, • • • , is to be 
divided in all possible ways into two non-empty, disjoint, complementary parts 
which form the subscripts of the two a’s in question; the subscript of v is equal 
to the subscript of either of these two a-values; it makes no difference which, 



PROBiLBILmr THBORT QF UNKAOS 


45 


because of the specific symmetry (S') of the l.d.; it should be noted iliat in 
subscripts of v no comma occurs. As an example let us write for m 8. 

We get: Ai2t4,HT.8 =* t'8«8"*«u»V*67 + Vfgtain aiatV8 + vauotiiuoinA. Or if we wish 
A 18,84.8.8 for m = 


+ ru«i8 ^<88^8.8 + 


VtfXt”^C 

113,84,8.8 *= <>808"*<*18.84,8 + r806"*0i8,84,8 
084084 *018,^8 + 088<*8,8*Ol^M + PUfOu.sOMls + 0i38Oi2,8O8^8 

Hence, in principle our “integration’ 

completely solved: First pi"' is given by (41). TTieii^ in order to find any 

. we first determine* the corresponding Aa^I . . by the rule just 

explained and illustrated, and then it follows from (44') that 


problem, wh6^ n is the variable, is 


(44'") 


Oili.iij,. 


••An 


Vo 


n-l-F^(F) 


Ai.A 3 ,-‘*.A^ 


This whole procedure, although in principle very simple, may of course be lengthy 
if m is not small and if no specific assumption for the l.d. is considered; for in the 

expression of Aa^.az a^ many different a-values appear,—each however with less 

than m subscripts—^which play the role of abbreviatidns for complicated ex¬ 
pressions; in other words the explicit solution for m = 6, for instance requires 
the solutions for m < 6, all these solutions being however completely given by 
our formulae, down to m = 2, where and are given by (23). 

Under simple assumptions for the l.d. the explicit expressions for the a become 
simple. Two extreme cases are complete linkage and free assortment. In 
the first case pi^A .m == PiV- ^m and nothing remains to be done. The case of 
free assortment where all v = can be dealt with directly by induction, or 

we may evaluate the general formulae given above which in this case become 
quite simple. We have* 

(45) .A, = 2"(2" ~ 1) • • • (2" - m + 1). 

That shows that the values of the coefficients a?!*? depend only on the number of 
elements ^4, which appear as subscripts. Thus we find e.g. for ?n = 6, if we 
write in each line of (45') one typical value: 


= 272 *^ = 1/2*” 

Ofl.28466 = «12.84M = ai28,486 = (2” — l)/2*” 

«1.2?3466 — 011.28.456 = 0(i2*,84,66 = (2” — 1)(2” — 2)/2*” 
a{.?.V466 - - (2” - 1)(2” - 2)(2" -- 3)/2*” 

a{.?.Vi.56 = (2” - 1)(2” 2) . • • (2” - 4)/2*” 

a}:2?8,4.5.6 - 4 (2" - 1)(2” - 2) • • • (2” - 5)/2*” . 


Thus in the simple case of independent assortment the explicit solution is very 
simple too. It confirms the fact that lim ai.2,V4,s.8 = 1 while all other a’s approach 


® The values on the right side of (46) are indicated in [1]; but the solution for free assort¬ 
ment reported in this article does not seem to coincide with ours. 
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zero. To prove this, however, without recurring to computations, was the 
purpose of the preceding section. 

9. Crossover distribution and crossover probabilities. The limit theorem of 
§7 as well as the computations of the preceding section, in short, all investiga¬ 
tions and concepts considered so far, are valid for any l.d. We shall now define 
and use a crossover distribiUion, (c.d.), which is completely equivalent to the l.d. 
but preferable for the study of certain particular cases. Apparently biologists 
have not considered the general concept of the c.d. but only the c.p. c„ . This 
concept is basic and tangible but not sufficient for a complete description of the 
linkage mechanism when m ^ 4, as was seen in the preceding sections. 

It is obvious that, from our point of view, a mathematical theory of linkage 
must be based on the properties of and a set of assumptions on the l.d., or the c.d. 
The linear theory will be considered from this standpoint. This theory is, of 
course, still compatible with a variety of particular assumptions. In the last 
section some simple particular cases will be presented and studied with a special 
view to interference. 

The probability that an individual transmits the set of “paternal genes’^ be¬ 
longing to A and the set of “maternal genes’’ belonging to A' = S — A is denoted 
by i(A), where 1(A) = l(A'); e.g. with m = 8 : i(l ,0,1,1,1,0,0.1) = /(0,1,0,0, 
0 , 1 , 1 ,0). Considering here the succession of arguments we see that in either set 
of eight arguments: The first and the second are from different sets, the second 
and the third are again from different sets, the third and the fourth are from the 
same set, * • • the seventh and eighth are from different sets. Writing 0 for 
“same” and 1 for “different” and using these numbers to correspond to the 
(m ~ 1 ) consecutive intervals between the m genes, we introduce: 

Z(10111001) + KOlOOOllO) = 7 r( 1100101 ). 

Here 7 r(r 7 i, , • • * , Vm--i) where 77 , = 0 or 1 , is an (m ~ l)-variate alternative. 

The relation between the l.d. and this new^ distribution may be written in the 
form 

(46) 2l(€i , € 2 , • • • , €„*) = €1 — €2 I, I €2 — C31, • • • I €m-i — €,« |), = 0 or 1. 

In this definition no fixed “order” of the genes is implied so far. The numbers 
1,2 • • • m are used like names. 

But it seems to be admitted today by leading biologists that a certain natural 
order of the genes exists. If this is so the numbers 1,2, • • • m should be used 
in agreement with this order. Let us note, however, that the situation is in 
reality slightly different: Only the genes within each linkage group (§4) are assumed 
to be ordered, whereas no order exists aniong the groups. Let us for the moment 
disregard this circumstance and assume that all genes under consideration 
belong to the same linkage group. 

Within such a linkage group a one-dimensional or linear order prevails, to be 
understood in the geometric sense of “location”. Some more precise definitions 
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concerning this linear order will be considered later. For the mom^t we mmply 
imagine that each of the two sets of genes belonging to an individual is arranged 
like m consecutive discrete points on a lifte segment.® The crossover distribution 
r(rfi , T/ 2 , • * • Vrn^Of introduced in (46) becomes more meaningful under this assump¬ 
tion where, now, the numbering corresponds to this linear order. Then the 
argument 0 in this distribution can be interpreted as “coherence*^ and the argu¬ 
ment 1 as ^'interchange'^ or "crossing over" and the "intervals" as intervals in 
the geometric sense. Whether this "crossing over", which means transition 
from the maternal to the paternal set or vice versa, is to be conceived as a 
"break" (Janssen's chiasmatypie) does not matter for the above definitions. 
If however, the idea is that between two neighboring genes not more than one 
break is possible then the "event," which we call crossover, would be at the 
same time a break; if, biologically, more than one break between t and (t + 1) 
is not excluded, then the event "crossover within (t, t + 1)" means "odd num¬ 
ber of breaks within this interval." 

Now, let us consider the relation between the c.d. and the c.p. It has been 
repeatedly remarked that the c.p. are not equivalent to the l.d., hence they are 
not equivalent to the c.d. either. There are § m(m — 1) c.p. but — 1 
i-values, or ^-values. If m ^ 4 the second number is greater than the first. 
Besides, the i-values are absolutely arbitrary probabilities. For the c.p. in sec¬ 
tion 4 some restrictions were derived. Let us derive another set of restrictions 
by considering four numbers t, j, fc, I which we may denote by 1, 2, 3, 4. (The 
following computation has nothing to do with linear order. It applies t/ m = 4 
to the l.d. i(ei€ 2 € 8 € 4 ) and if ?n > 4 to the respective four-dimensional marginal 
distributions of the l.d.) Write t;(€i€ 2 € 8 € 4 ) = 2J(€ie2€3€4) and let us add up the 
six c.p. corresponding to these four numbers. From c,y = 2lij{l\0) = t;<,(l,0) 
we get 

c,2 + ci3 + • • • -f C34 = 3i^(1000) + 3t;(0100) + 3r((X)10) + 3t;(0001) 

(47) + 4K1001) + 4r(1010) + 4i;(1100) 

= 4 - 4t;(0000) - f;(1000) - v(OlOO) - t;(0010) - t;(0001) g 4. 
Hence as by (14) Ci 2 + C 23 + Ciz ^ 2, it follows that 
(14') + Cyi + ck S 2 

is another necessary condition for the c.p. The limit "2" can be reached, as we 
see for v(OOOO) = v(lOOO) = i;(0100) = t;(0010) = t;(0001) = 0; then 

C 12 = C84 = v(lOOl) + KlOlO) 

C23 = ci4 == t;(1100) + t;(I010) 

_CIS = C 24 = t;(1001) + r(llOO) 

® ‘'The genes are represented as lying in a line like beads on a string. The numerical data 
from crossing over show in fact that this arrangement is the only one that is consistent with 
the results obtained*’ [11]. This is but one of many statements in favor of the linear theory. 
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and 

Ci2 + C28 + Ciz *= Ci 4 + C24 + C84 *= 2. 

To summarize the facts about the c.p.: In case of m cha/racters there are 
\m{m — 1) c.p. Cij = 2lij(l0) = 2?<y (0,1). These values must satisfy (he follow¬ 
ing necessary conditions (besides 0 g g 1): 


(13) 

Cij + Cjk ^ Cik 

for any three subscripts 

(14) 

Cij + Cjk -h Cik S 2 

« « « t( 

(140 

Cii + Cji + ^ 2 

“ “ four “ 


If in an analogous way five or more subscripts are considered no new condition 
turns up. It has, however, not been proved that the above given necessary 
conditions are sufficient for a consistent system of c.p. If we wish to be sure of 
consistency the starting point must be a l.d. or a c.d. from which the c,y are 
deduced. 

[This question of consistency belongs in the same class as the following problem: 
“Under w^hat conditions does a set of distributions F,v(x,-, Xj) form the mar¬ 
ginal distributions of second order of an m-dimensional distribution F(xi, • • • , 
Xfn)V* Here V{xi^ • • • , Xm) is the probability that the first result is ^ Xi, the 

second ^ x ^, the last ^ x^ . An analogous question arises for the set of 

distributions Vijk(xiy Xj, Xk)t etc.^°] 

In the following it will be necessary to know the expressions of the c.p, in terms 
of the c.d. Put m — 1 = /i and denote by p,, pa , etc. in the usual w ay the 
follow^ing probabilities derived from the c.d.: pi is the probability of “success'' 
in the t-th trial, pij the probability of success in both the f-th and j-th trial, etc. 
It has to be kept in mind that for the c.d. and all magnitudes derived from it the 
“i-th trial" is associated with the t-th interval, i.e. with the interval (?', i + 1) 
“and success in the f-th trial" means cross over in this interval. [Whereas in 
the l.d. and in magnitudes derived from it, like Cij = Uj (1, 0) the subscript i 
denotes the z-th gene. (See (46)).] Now denote by Si the sum of all prob¬ 
abilities Pi , by S 2 the sum of all pij , ^ . Besides, let P\,..,i{x) be the probability 

of exactly x successes in the first i trials (z = 1,2, • • • , ?z), and analogously, 
P 2 ...j{x) the probability of x successes in the j — 1 trials 2, 3, • • • , i, etc. Then 
the desired formulae follow’ easily: First we have obviously 

Ci.w = Vi (i = 1,2, ••• ,n). 

Because is the probability of one interchange betw’een the genes i and 
t + 1 i.e. of an interchange in the z-th interval, of “success in the z-th trial". 

For one-dimensional distributions Vi{x) the question is trivial because any set of m 
distributions Vi (x) can be considered as the marginal distributions of first order of V{xl , 
••• , arm) - Vi(xi) ••• Vfn(Xm). 
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Then Ci ,<+2 is the probability of one interchange between i and i + 2, i.e. of eith^ 
an interchange in the fin^t of the two intervals numbered i and i + 1 , and no 
interchange in the second; or of an interchange in the second but none in the 
first. Hence Ci,i ^2 * P,..• 4 . 1 ( 1 ), (t =» 1 , • • • , n — 1 ), because P,-,,-+i(l) is just the 
probability of exactly one ‘^success” in the two trials numbered i and i + 1. 
In the same way we get c ,-.<43 = P*....*^- 2 (l) + P......+ 2 ( 3 ), (t = 1 , • • * , n — 2), 

because an interchange between i and i + Z means either exactly one or exactly 
three interchanges in the three intermediate intervals. Hence we get altogether, 
with n = m — 1 : 

= pi 

(48) c,,,+2 = Pt, <4.2(1) 

Cim = Pi 2 ...n(l) + Pi2...n(3) + * * * Pi 2 ...n(n), whcrc ft = 71 if u odd, 

= n — 1 if n even. 

Let us also express the in terms of the S<. It is well known (see e.g. [3]) 
that 

(49) Pi,....»(x) = Z (X = 0, 1, • • • n). 

VmmX 

Applying these to (48) we easily find the convenient expressions: 

Ci2 = Pi , etc. 

Ci 3 = ipi + P2) — 2pi2 = (Si — 2^2)12, etc. 

Ci4 = (pi + P 2 + Pa) ~ 2 (pi 2 + Pl3 + P 28 ) + 4pi23 

(50) = (Si — 2S2 4" 4S3 )i 23 , etc. 
C15 = (Si — 2S2 4 “ 4S3 — 8S4)i.., 4 , etc. 


ci.m = Si — 2 S 2 + 4S8 4“ • • • 4“ (—2)’"Sw~i. 

10. The linear theory. Consider a linkage group of size m and assume for the 
moment that Cij 9 ^ Cik for all z, j, and k. It seems that the main mathematical 
content of the linear theory can be summarized as follows: It is possible to 
establish in a unique way an order or a succession of the genes, such that for the 

—11 c,p, the (m — l)(m — 2) ineqxiaUties 

^ij ^ (f = 1, 2, • • • , Wl 2) 

o o IX 

Cij < Ci^ij (t = 2, 3, • • • , m — 1) 

hold. In this succession j will be between i and k if Cit is greater than the two 
other c.p. c.j and cjk . The two arrangements 1, 2, • • • m and 7n,m-- 1, * * * » 
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• • • 1 are considered as corresponding to the same order. Furthermore, this order 
is a straightrline-succession for which an additive distance relation holds (cf. also 
[4a] and [13]). Instead of the restriction ca 9^ Cik it is sufficient to assume 
the weaker restriction, that in any triple Cij, Ca , Ckj one is greater than the two others. 
Without such a restriction uniqueness of the order no longer holds. E.g. in 
case of independent assortment where all equal ^ any of the m: possible num¬ 
berings of the genes is equally admissible from the point of view of the linear 
theory. In the case of complete linkage where all c.p. are zero it will be logical 
to consider all m genes as located in the same point. Obviously there are all 
kinds of intermediate cases. We shall come back to this point at the end of 
this section. 

Now consider again the case of 'different* c.p. (in the above defined sense). 
Let us prove that there can be not more than one succession for which (51) holds. 
In fact it follows from (51) that also: 


(510 Cij < Cik (i = 1, 2, • • • , m — 2) for all k > j 

and Cij < Ckj (f = 2, 3, • • • , m — 1) for all k < i. 


These are aU together il/ = 2-1 + 3-2 -h • • • + (m — l){m 


inequalities. 


On the other hand there are all together 



- = K?) 

M/2 “between’- 


relations for m numbers, each of them being defined by two inequalities as 
Cij < Cik and Cjk < Cik (if j is between i and ifc); hence on the whole M such in¬ 
equalities. But these are the same as (510, as we see by changing i, j, k into 
j, k, i in the second equation (510- Thus it is not possible to find two different 
successions which both satisfy (51). 

As to the metric of the problem, Morgan propased originally that the value of 
the c.p. Cij should be used as the distance between i and j. It has, however 
been objected repeatedly that this distance would not be additive; this is ob¬ 
vious since the triangular relation (13) holds for three subscripts (see also 
(50).^^ The equality c,j + Cjk = Cik holds only in the exceptional cases where 
multiple crossingover is excluded. It seems, however that an adequate defini¬ 
tion of distance is available if we try to formulate in terms of probability theory 
what the biologist had in mind. Let j ^ i. The distance di.j+i between i and 
j + I may be defined as the mathemaiical expectation of the number of crossingovers 
in (i, j + 1), i.e. in the j + I — i intervals between i and j 1. Hence if 


For a geometric equivalent of m points with m(m — l)/2 arbitrary distances we would 
have to turn to an (m — 1)-dimensional space. In fact it is well known that there are 
between k points in the plane only ^ 2k — 3 arbitrary distances, in space only St « 
3A; — 6, in r-space Sr * r/c — r (r -f- l)/2. Hence for r * m — 1 and k ^ m: <Sm-i “ 
m(m - l)/2. 
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Pt .y(x) denotes, as before, the probability of exactly x crossovers in t^ese 

(j + 1 — i) intermediate intervals the formula holds 

(52) =’Z’ 

Of course, an appropriate unit may be used such that in practical use the distance 
becomes proportional to the d,y introduced above. 

The mean value to the right in (52) is well known for any distribution 
v{xi , * • , Xn) whether an “independent” or a general distribution; (i.e. in our 
case: with or without “interference”). Denoting in the usual way by ir,(xi) 
the marginal distributions of first order of w{xi , • • • , Xn) and putting irt(l) = 
Pi = the probability of success in the i-th trial, we get: 

(53) di.y+i = Pi + + • * • + pj 

and in the same way with k > j 

dy+i,*+i = py+i + py42 + * • * + P* 
diMi = Pi + Phi + • • • + Pifc 
hence di,j^i + , or in general: 

(54) dij 4“ djk = dik (i ^ j k)^ 

It may be mentioned that the additive property of the mathematical expecta¬ 
tion which was used here is very well known (particularly for independent events) 
but not always correctly proved. The proof is contained in the transformation 
expressed in the following equalities: 

j+l-i 

di.j+i = Z xPi...i{x) 

x—0 

(55) ~ ^ ^ ^ ^ ^ ^ (Xt 4“ X,'-fi 4” * ’ * 4" Xy)iri,i^i,.. .y(x, ) * * • ) Xy) 

Xi Xi^i Xj 

= Z Z • • • Z (^* + Xi+1 + • • • + Xi)ir{Xl ,Xt,---, Xn). 

Xi Xi Xn 

(For general distributions Stieltjes integrals replace the sums.) In (55) 

t(xi , • • • , Xn) is the given n-variate distribution. Pi .y the probability of 

exactly x successes in the successive trials numbered i, • • • yj and 

Ti,i+i .y(xi, • • • Xy) IS the respective marginal distribution of ir(xi, • • • , Xn). 

The first equality in (55) is not obvious, while the second is rather trivial. From 
the second or third form of dij^i in (55), follows (53). The last expression in 
(55) shows that the expectation of any such sum as (x, 4- x,>i 4- • • * Xy) can be 
computed with respect to one and the same distribution t(xi , * • • , Xn). 
Therefore the distance di.y+i may also be defined as the expectation of {xi 4* xy+i 4“ 
• • • + Xy) mill respect to the c,d, ' 
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Because of the first equation (48) we get from (63) 

(530 • dii = + Ct+i.t+2 + • • • + . 

Hence the distance is equal to the sum of thej — i intermediate c,p. No difficulty 

arises for us from the obvious fact that always 

(63'0 Cij ^ d»y; and in general cyy < da, 

because the distance d^ is defined by (62), or (55) and not as co . 

On tiie right side in (530 stands the sum of certain c.p. We have repeatedly 
remarked that there may be hitherto unknown restrictions for a consistent sys¬ 
tem of c.p. Hence it is important to notice that there are no restrictions for the 
particular (m — 1) c.p. Cu, C 28 , ••• , . They can he quite arbitrarily 

chosen because of = pi . Hence any geometric representation of m genes 
arranged on a straight line in arbitrary distances (i = 1, 2, • • • m — 1) is 
surely consistent. E.g. m consecutive genes may be arranged with equal distances 
dn ^ d^ — • • • = dm-i.rn . Or some distances may be zero; then the respective 
genes are localized in the same point, etc. 

Finally, let us briefly consider the case of several linkage groups. According 
to §4 the l.d. then resolves into a product of several distributions; e.g. with 
m «= 12: 


i(€l€2 ) • * * > €12) a® /l(€i62€8€4)/2(€6€«e7)/s(€8€96io)/4(€iiei2) 

(56) = (i)Vi(| - t4|) 

ir*(| «» — «» 1 , I *6 — *7 1) • • • IT.d *11 — *1S I). 

Then, as postulated by Morgan, the linear order holds within each of the k groups^ 
whereas all c.p. among the groups are equal to 
Let us conclude this section by transforming the basic conditions (51) of the 
linear theory by means of (48). This will be needed in the follo>\ing section. 
Consider e.g. the condition Cia < Cu, i.e. 

(670 Pl 2 (l) < Pl 2 s(l) + Pi2s(3) 

or C 24 < Ci 4 3 nields P 28 (l) < Pi 28 (l) + Pm{S). Or in the same way: 

(57'0 Pi 2 s(l) + Pm(3) < Pi2a4(l) + Pi284(3) 

< Pi...., 5 ( 1 ) + Pi,..., 5 ( 3 ) + Pi. b( 5), etc. 

Thus we may express the content of (61) as follows: The probability that the 
“evcnf” happens an odd number of times in a set Ti of i consecutive trials is less 
than the probability that the event happens an odd number of times in the set Pt+i 
or in the set Ti^i each consisting 0 / t + 1 consecutive trials where Ti^i and T[^i 
denotes respectively the sum of Ti and either the immediately following or the imme¬ 
diately preceding trial. In this form we see again that the linear theory is an 
assumption, suggested by observations, and by no means logically necessary. 
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11. Some models of c.d.’8 based on the linear theoiy. The simplest ami 
very important example which has been suggested repeatedly is tiiat o( inde¬ 
pendent crossovers: 

(t) Independence. The crossovers do not influence each other, i.e. 

(68) Pa * P<Pi, Pnh = 

That this distribution is consistent is well known; hence only the specific ine-* 
qualities (48) or (57) have to be considered. Here the expressions Pw...<(ic), 
used in (67) become very simple, e.g. with p» + = 1: 

Pi...4(1) = piM«^4 + qiPiq^i + qtqiPzqA + ?iMiP4. 

Then a simple computation shows: 

Ci.y+i - Cii = (g< - p<) • • • (gi.1 - PM)Py 
Ci-i.i - Cii = Pi^iiqi - p<) • • • (g/-i - Pi-i)- 

These differences will be positive if all qi — p< > 0 or all p, < J. Hence; A 
consistent c.d. which fulfils the conditions (61) of the linear theory is the distribution 
of ^^independent crossovers'^^ with basic probabilities pi = c»,,+i (t =« 

1 , 2 , • • • m — 1 ), with the one restriction 

(60) Ci,i+i - PiS h 

The distribution is completely determined by (68). If all pi ^ p = J, we have 
the particular case of free assortment. 

Although this independence is more general than MendePs original assump¬ 
tion, Morgan, Haldane and others reported observations, not in accordance 
with this hypothesis. One crossingover seems to prevent others in a certain 
neighborhood*’. This phenomenon was named interference. It suggests that 
we have to consider the c.d. as a distribution of dependent rattier than of in¬ 
dependent events. This will be done in the following pages. First conender 
the limit-case of: 

{ii) Complete interference or disjoint events. In this case we have 

(61) PiS = Pijk • • • = Pl2,...,m-1 = 0. 

Thence it follows that we have simply 

= Pi 

(62) C,-.,+2 = Pi + P4+1 

c*.»+s =* Pi + Pi+l + Pt-f 2 , etc. 

In this particular case the c.p. are additive c<y = dij. It is obvious from (62) 
that in this case the conditions (61) o/ the linear theory are fulfilled. On the other 
hand it follows from (49), (for x = 0) and (61) that the system is consistent if 
and only if 

(63) 


s Pi + ?>2 + • • • + p, ^ 1 


(n = m — 1). 
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This is in accordance with the fact that nearly or exactly additive c.p. have been 
observed always in connection with very small prvalues. 

The most striking observation leading to the concept of interference was that 
Vu ^ VxVi »i-e. that double crossovers appeared less frequently than one would 
have assumed for independent crossovers, but that nevertheless they did appear 
sometimes. A particularly simple model of dependence or interference which 
starts with this fact, preserving however, the main structure of independence, is 
the following: 

(m) One-parametric model of partial interference. Assume as before (m — 1) 
basic probabilities p, and put 

(64) pij — epiPj , Vijk ~ epipjpk > * * ’ > etc. (0 ^ ^ 1). 

There is independence if € = 1, complete interference for 6 = 0 and partial 
interference for intermediate values of 6. Let us first investigate conditions 
for the consistency of this distribution. Necessary and sufficient conditions 
for a consistent, distribution of arbitrarily linked events are well known (see 
e.g. [3] (b) p. 239). Write m — 1 = n. A system of pi , p,y, • • • , pi...n is con¬ 
sistent if it is possible to compute from these (2"" — 1) values, 2" non-negative 
values • »7n) (vi = 0 or 1) which have the suiri one and are given by 

the formulae: 

7r(ll • • -1) = Pl2 ...n 

7r(ll**‘10) = Pl2. -.,(n-l) Pl2, -.,n 

(65) 7r(110* • -0) = Pl 2 — X) pl 2 ni + XI 2 Pl 2 n.n 2 ± Pl2 • n 

(ni) ni n >2 


7 r( 00 * * -0) = 1 — P,H 4" £ X3 Vnin 2 — * * * ± Pu-.-n • 

w 1 «1 2 

Because of the symmetry of (64) it will be sufficient to check (65) by means of 
the relations (49) which can be obtained from (65) by collecting groups of 
equations such that the corresponding w(rji , • • * , »?n) show all the same number 
of Ts as arguments. Write Pi,..nix) = Px for the probability of x successes 
in the n trials where independent events with basic probabilities p, are con¬ 
sidered, and Px for the analogous probability corresponding to the distribution 
(64) and introduce in the same way Si and where as before, S 2 = 2p,y, 
Sz = ^Pijk , etc. We then find: 

Po = 1 — aSi -f" €^2 — 6/83 -[“••• 

( 66 ) 

= Po + (1 — €)(— >82 + <83 — AS 4 + • • •) = Po€ + (1 — a8i)(1 — e). 

It follows that the expression Poe + (1 — a 8 i )(1 — c) must be ^ 0 . For 6 = 0 
this condition reduces to (63), whereas for e = 1, there is no restriction at all. 
On the other hand this is the only restriction of this kind, because we find 

P[ = .8'i 252 + 35 ; ~ 45 ; + • • • = 5i ~ 2652 + 3653 ~ 4654 +••• 

= Pi + (1 - 6)(252 - 358 + 454 • • •) = Pie + a8i(1 - 6). 
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This last expression is always gO because of Pi ^ 0, iSi ^ 0, e ^ 1. Further¬ 
more we find for t ^ 2 that P< == Pa, hence always non negative. Therefore 
our system is consistent under the one condition (66). 

The additional restrictions corresponding to the linear theory have still to be 
considered. A simple computation yields the result 

(67) ~ 2«p,)(l - 2 €P<+i) •••(!- 2tpi-i)pi 

Ci.,+1 - C<+i.y+i = Pi(l - 2*Pi+i) •••(!- 2«py). 

These differences are ^0 if p< ^ ~ which is, for e < 1, less strong tlian (60). 

Hence we sum up: A consisimi model of partial interference with one parameter t 
to fit the observations can be obtained on the basis of n — m — \ prob¬ 
abilities Pi, P 2 i • • * Pn by means of (64), if the condition 

(68) Poe + (1 - -S,)(l - *) ^ 0 or S, gl + j-^^Po 
holds and the additional restriction required by the "'linear theory^^ 

(09) p. S i 

is satisfied. For € = 1 this reduces to ^'independent cvents^^ or "no interference^^ 
with no restrictimi (68), and (69) reducing to (60). For € = 0 our model yields 
"complete interference^^ or "disjoint events^^ with restriction (68) becoming (63) 
and no restriction (69). If we say that this model contains one parameter only, 
the idea is that the pi are to be identified with the basic c.p. c^.t+i. It might, 
however, seem adequate to consider e and Pi, • * • , Pm~i as m available param¬ 
eters which may be determined from the observations by some appropriate 
method. 

(iv) An (m — I)-paramc^r^c model of partial interference. Numerical data 
show (see particularly [4]) that interference is particularly marked i.e. pij < pgpj , 
if the corresponding p,, pj are very small, whereas for greater values of the Pi 
we have more nearly the pattern of independence. This is rather a striking 
fact, and seems to be well confirmed by observation. In these final pages a 
model will be studied which takes into account the circumstance that the amount 
of interference seems to depend on the magnitudes of the p,. It contains 
(m — 1) parameters, is therefore rather flexible, but nevertheless very simple. 

Assume m — 1 = n numbers €, where 0 ^ ^ 1 and form by means of n 

probabilities p,- ; 

(70) upi = p, (0 g €.• g 1) (t = 1, 2, ... , n). 

We may choose u small if the corresponding p» is small and larger if it is large; 
if the p’s are all of the same order of magnitude the €’s need not differ much 
either. Then we simply define: 

(71) Po = pipy , p.yjfc = piPipk , • • • , Pl2...n = plpIt * * * pn . 
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Let US investigate the consistency of this model. In analogy to (66) we form 
with Si « 2pi, Si == 2^,-, S 2 = ^pipj, etc.: 

Po 1 Si + Sj — Ss + • • • 

(72) 

= (1 — Si + S 2 — S 3 + • * •) ““ (1 «*)p» ~ Po — X) (1 ^i)pi 

«-l i-1 


where Pq and Po are the probabilities for zero successes for the model under 
consideration and for independent events with basic probabilities pi respectively; 

n 

hence Po = IT (1 — api) and we get the condition: 


(73) n (1 — tiPi) ^ £ (1 — €i)pi 


or: S P.- ^ pi + li (1 - pi)- 

i-1 t-l «-l 


If all €i = 1 there is no restriction (73), while for €» = 0 we find again (63). The 
consideration of Pi, P 2 , * * * yields no new condition, because we get, denoting 
by P,• the probability of i successes for the independent events with basic prob¬ 
abilities pi : 

Pi — <Si — 2 S 2 + 3 S 3 — • • • db nSn = Si — 2 S 2 + • • • zfc nSn + X) Pt(l €|) 

t-1 

n 

= Pi + XI P*(f ““ €*) ^ 0 and: 

«-i 

P< = Pi ^ 0 (i ^ 2). 


As for the restrictions imposed by the linear theory we find: 

{^A\ = (1 — 2pi)(l — 2pi^i) ... (1 — 2pj^i)pj + py(l — €j) 

(74) 

' Ci,j+l - Ci+i.y+i = pi{l - 2pi+i) • • • (1 - 2pj) + pi{i - €<). 

Thus the conditions of the linear theory are satisfied if 

(75) pi ^ I or p. g A . 

Hence summarizing: On the basis of m — 1 prohabililies pi a consistent model of 
partied interference is Obtained by means of (70) and (71) if the condition of con¬ 
sistency (73) and the conditions (75) are satisfied. 

It may be that the four simple models described in this section will seem too 
crude for the description of the complex mechanism of linkage. They could, 
of course, be combined and modified in various ways in order to serve at least 
as an approximation to the theoretical picture of reality we wish to construct. 
But, while these particular attempts may be inadequate, it seems to the author 
that the underlying principle is not wrong: that a mathematical theory of 
linkage must finally consist in statements on the l.d. (or the equivalent c.d.). 
Tlie consideration of the c.p. is not sufiBcient for this purpose. The mathe¬ 
matical instrument for a theory of linkage seems to be the probability theory of 
the linkage distribution. 
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THE COVARIANCE MATRIX OF RUNS UP AND DOWN 

By H. Levene and J. Wolpowitz 
Columbia University 

1. Introduction. Let Oi, • • • , be n unequal numbers and let the sequence 
/S = (Ai, // 2 , * ‘ , hn) be any permutation of ai, * • • , On . /S is to be considered 
a chance variable, and each of the n\ permutations of ai, • * • , On is assigned the 
same probability. Consider the sequence R whose i*^ element is the sign (+ or 
—) of hi^i ~ , (t = 1, 2, • • , n ~ 1). A sequence of p consecutive + signs 
not immediately precjeded or followed by a + sign is called a run up of length 
p; a sequence of p consecutive — signs not immediately preceded or followed by 
a — sign is called a run down of length p. The term “run** will denote both 
runs up and runs down. The usage of the term ‘‘length’^ varies; most quality 
control literature attributes the length p + 1 to the runs which we say are of 
length p. 

As an example of our usage, the sequence 

S = 2 8 13 1 3 4 7 

gives the sequence 

= + + - + + +, 

which has a run up of length 2, followed by a run down of length 1, followed by 
a run up of length 3. 

Runs up and down are widely used in quality control and have been applied 
to economic time series. The purpose of this paper is to obtain their variances 
and covariances and to correct some erroneous notions prevalent in the literature 
about their application. 


2. Notation. If the sign (+ or —) of — hi) is the initial sign of a run 
defined as above, we call hi the initial turning point (i. t. p.) of the run. Then 
h\ is always an i. t. p., and we adopt the convention that hn is never an i. t. p. 
We define new stochastic variables as follows: 


( 2 . 1 ) 

(2.2) Xp.- = 

(2.3) 

for t = 1, 2, • • • , n. 


__ f 1 if hi Ib an i, t. p., 

* ~ \o otherwise, 

f 1 if hi is the i. t. p. of a run of length p, 

\0 otherwise, 

if hi is the i. t. p. of a run of length p or more, 
otherwise. 


Also 


(2.4) r = the number of runs in /?, 

(2.5) Tp = the number of runs of length p in /?, 

(2.6) Tp = the number of runs of length p or more in R. 

n n n 

Evidently r = £*.•.»■?= ^ , and r, = . 

t-1 t-1 
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If X and Y are stochastic variables, let E(X) denote the mean of X, a(XY) 
denote the covariance of X and F, and o^(X) denote the variance of X, if they 
exist. By the distribution functicm of X we shall mean a function/(a?) such that 
P{X < a:} s /(x), where the symbol P[ } denotes the probability of the re¬ 
lation in the brackets. 


3. Preliminary formulas. Let F' be a stochastic variable with any continuous 
distribution function f{y). Let F = (yi, yj • • * ^ ^n) be a sequence of n inde¬ 
pendent observations on F'. Since P\yi = y^] = 0, (t ^ j; i,j == 1, 2, • • * , n), 
the distribution of runs up and down in F is evidently the same as that in S. 
Now choose fiy) to be 

m - 0, (y< 0), 

fiy) = 2/, i0<y< 1), 

fiy) = 1, iy > 1). 

Then 

P{yi-i < Vi > y,+il = ^ ^jf dyi-^ dy^dyi+i = 

By symmetry 

E{xi) = P{yi-i < yi > y<+i) + P\yi-i > yt < j/.+xl = f, 

(i = 2, 3, • • • , n — 1). 

Also E(xi) — 1, and E(xn) = 0. 

It will be necessary hereafter to evaluate expressions of the types 


(3.1) 
and 

(3.2) 



{k + p)!’ 





dy 


P * 


From the fact that 


-1 y.1 pVj,+ l 1.1 

J ■■• dyi---dyp= ■■■ 

^Vp + 1 •^V2 •'0 •'0 


T2 # 

dvi 


dvp = 

p! 


where v, = (1 — py), 0 = 1> • • • > P + 1), it can easily be shown that 
F = £ (-1)'+* - + (-1)" 


(3.3) 


,_i (fc + a) !(p — s)! 

(p + k-r)lrl- 


(* + p)! 


/•I ^Vp + 2 

We shall also need J V dyp^i and J J V dyp+i dyp+i . Now 


(3.4) 
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Making use of the relation, 

tc-ir—^ 

r^O 

we have 


= (-1)' 


(3.5) 

Similarly 

« a 


(n —r)lr! ' n(» —<—!)!<!’ 


(t < n), 


1 


ifp+t 


V dvp+i dyp+t = 


(p + fc + l)p!*!r. 


1 


1 


(p + A: +l)p!4:! (p + A + 2)(p + 1)! fcl ‘ 


4. Covariances of runs up and down. We first compute E{rp) and E{rp). 
We define the symbol 

P{-, } = P{yi-i >yi< yi-n < ■ ■ ■ < yt+p > 2/<+p+i). 

The value of the right member is independent of i whenever it is defined {i.e. 
t — 1 > 1, i + p + 1 < n). Now 

E{xpi) = P{-, +^ -} + P{+, +1 = 2P{-, +^ -1 

rvi^p f' p* + 3p+l 


-»/7' /’ 

•'0 Jti+p+i Jo 


(P + 3)! 

(t =2, 3, •••,n - p - 1). 

E(xpi) = 2P|+^ -} and E(xp.n-p) = 2P{-, +”}. 

7) 1 

By symmetry E(xpi) = E{xp,n-p), the common value being 2 , , ,. Also 

(p + 2)! 

E(,Xpi) = 0, (t > n - p). 

Thus 

P(rp) = E (Z Xp^ = 2E{xpi) + (n - p - 2)^(xp,) 


(4.1) 


= 2n P* + ^ _ 2 P* + 3p^ - p - 4 


(p < n — 2). 


(p + 3)! (p + 3)! 

Besson [1], Kermack and McKendrick [5], and Wallis and Moore [6] gave the 
exact value, although Besson proved it only for special cases. R. A. Fisher [3] 

E(rp) 

**™m ■ 

It is clear that E(wp{) = E{Xp,n-p), (t = 2, ■ • • , n — p), while E(wpi) = 
2P{+’’) = 2/(p — 1)1. We then have » 



RtTKS XTP AND DOWN 

Setting p bb 1 we have 

(4.3) Eir[) = E(r) - i(2n - 1). 

Fonnula (4.3) was given by Bienaym4 [2J. 

We now obtain <r(r,r,). Let (Xp.- — E(xpi)) = «,<. Then 




(4.4) 


v(rpr,) 




= X) E(ZpiZ^) + £ 2 El{z^z„) + 23 ]C El{zpiZ^i). 
• • <</ 


i<i 


For j > t + g + 3, x,< and Xpj are independent and hence F(a,<Zpy) » 0. Omit¬ 
ting zero terms from (4.4) we have 


(4.5) 


= ES E{x^x„)+ ZE E{x^x^i) 

% i<i< ff4** »*<y < 

- [1LE{x^)E(x^) + EE E{x^)Eix^) 

i »<;■<*+«+» 

+ EE E{x^)E{x,i)\\- 

»<y<»-*-p+8 


Since Xpyx,,- = 5p«(Xp<)* =» Sp«Xp.-, we have for the first term ofihe right member 
of (4.6) 


(4.6) 


E{XpiXfi) — ^pfE^Tp), 


where the Kronecker delta 8p, 


^ /l, if p = g, 

0, otherwise. 

Since x,iXpj = 0 for i < j < i g, the second term in the right member of 
(4.5) is 

n—jg—p—y—2 

l-l $-1 *-l 

= {(n - p — g — 2)E{x^Xp,i+t) + (n — p — g — 3)£(x^Xp,»+.«.i) 

+ (n - p - g - 4)F(x,pXp,<+*+*) 

-H F(x,iXp.,H-i) + E(x^iXp,t+t) + Eix^iXp.^) 

"i" E^^q,n~q—.pXp,ii —p) 1 -p—1 Xp,n—p) 1 E^Xq^n-^q-^p —J^p.n—p)}* 

Now F(x,jip.<+j). = 2P{—, -|-®, +} = 2A, where 

n V»+p + « + l /•! /•! r fVi+fl rVi+l Pl “I 

/ •••/ 1 “i J 

• •%<+p+«+i • 


(4.7) 
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The expression within the square brackets is easily evaluated, and applying 
(3.6) to the result, we have 


ip + Q + {p + q + 2)(p + l)!g! 

{p + q + 2)p\(q + 1)1 ^ (p + g + 3)(p + l)!(g + I)!* 

Similarly, EixgiXp,i^g+i) = 2P{-, +^ -1, and E{xqiXp,i^g^ 2 ) = 

2P{-, +\ -, -, -} + 2P{-, +^ +, +}. The other terms 

in the right member of (4.7) are obtained in like manner. The right niember of 
(4.7) is symmetric in p and q; hence the second and third terms of the right mem¬ 
ber of (4.5) are equal. 

We now consider the remaining terms in the right member of (4.5) for p > q; 
the result obtained also holds for p < g. We write them as 


Ej{xg;^E{Xp^i~-(p-\-2)) *4" ■^(^5»)'^(^p.»—fp-f 1)) 

p+8 i—p+2 

n--^ n —q— l 

4" ‘ ' 4" Ej{Xg^E{Xp,i—{p—g)^ 4“ P(^9t)^(^p.»—(p—tf—1)) 


(4.8) 






4" • ’ * 4- 22 E{Xgi)E{xpi) 4" 22 J^{^qi)E{xp,i+\) 

t-i t-i 

n—p—C^+2) 

+-f- 2 E(x^)E(xp ,»+Cff'f2)) ^ • 

<—1 


The (p + (? + 5) sums in (4.8) comprise in all <(n — p)(p + g + 5) — 2 ^ 
teras. Remembering that E(xp,n-p) = E{xpi), (4.8) becomes 

- |[n(p + g + 5) - (p* + pg + g“ + 7p + 7g + 16)]£:(x„)£(a:,j) 

(4.9) 

+ [2p + 4\E{xgi)E{xpi) + [2g + 4:]EiXgi)E{xpj) + 2E{xgi)E(x,i)]. 

Adding the right member of (4.6), twice the right member of (4.7), and (4.9), 
we have 


ffirpTg) = 

p°(p + g + 6)(g* + 3g + 1) 

2« “I _o + P(3g’ + 20g* + 40g + 19) + <g’ + 9g* + 29g + 26) 

' ' (g + 3)l(p + 3)! 

lo —p + g+1 _2 1 

(p + g + 3)(g + 2)!(p + l)! ,(p + g + 5)(g + 3)r(39 + l)! 

_ O (p + 9)' + 9(p + g)* + 23(p + g) + 14 
(P + g + 5)1 

12 1 1 ; p1±3p+_1 

^^(p + g + Dglp!^®”*' (p + 3)! 




(4.10) 
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p\^ + 33 + 1) + p’(3* + 93* +193 + 6) 

I o I + p*(3* + 93* + 283* + 353 + il) 

2« _ 

(3 + 3)1 (p + 3)1 + p( 3 g,« + 203 * + 4 O 3 * + 293 + 10) 

1_ + ( 3 ^ + 93 * + 273 ^ + 323 + 10) 

I 2 (p + 3 + 2)(p — 3 — 1 ) . 2 _ p + 3 + 4 _ 

(P + 3 + 3)(3 + 2)!(p + 1)!^ (p + 3 + 5)(3 + 3)!(p + 1)! 

. 2(P + 9)‘+ 10(p + 3)‘ + 29(p + 3)*+16(p + 3) “ 19 

(P + 3 + 5)! 

o P + g _. p* + 3p* - p - 4 

(p + 3 + l)3!p! (p + 3)l 


where 8pq is the Kronecker delta. Formula (4.10) is valid for p + g < n — 4. 
It is symmetric in p and q. Setting p = qwe obtain 

,2p‘^ + lV + 41p* + 55p^ + 48p + 26 
- 2„|- 2-_____- 

^ 2p* + 9p + 12 4p* + 18p* + 23p + 7 

(2p + 3)(2p + 5)(p + 3)1 (p + 1)1 (2p + 5)1 

■ 2_ -1 _ I P' + 3p + 1 

^ (2p+l)p!p!^ (P + 3)! 

.. 1 ^ o/o 3p‘ + 24p‘ + 69p* + 90p* + 67p* + 42p + 10 

(4.11) + 2^2 — - 

2p* + lip* + 19p + 9 
^ (2p + 3)(2p + 5)(p + 3)!(p + 1)1 

2 19p‘ + 80p* + 116p* + 32p - 19 
^ (2p + 5)1 

_ 4__ p* + 3p* - p "^ 

(2p + l)p!p! (p + 3)! 

We next evaluate o'(rprg). Since Wqi and Wpj are independent for j >i + q+2, 
we have, corresponding to (4.5), 

(r(rpr') = 23 E(WpiWqi) + 2 £ E{WqiWpi) + 2 2 E{WpiWqi) 

i «</<»+p4-2 

(4 12) - E E{w^)E{w^) + 2^ 2^ E{w^)E(,w,i) 

+ 2:e ^(«>w)i?(w„)i. 

«;<<+!>+* 
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Let G » Max (p, q). Then Wp^Wfi m wai and we have for the first term of the 
right member of (4.12) 


(4.13) E EiWpiWpi) = E{r'a). 

The second term in the right member of (4.12) may be written 


(4.14) 


(n - p - q - + (n - p - g - 2)E(Wg/Wp,i^g^i) 

+ E(WglWp,g+l) + E(WglWp,g^2). 


T!iow E(wgtWp,i+g) = 2P{-, +\ -’'ly E(WgiWp,i+g+i) = 2P{-, +^ +^} + 

2P{ —, —''1, and the other terms are obtained similarly. The third term 

in the right member of (4.12) will be equal to (4.14) with p and q interchanged. 
The remaining terms in the right member of (4.12) become 


- |[n(p + g + 3) - (p^ + pg + + 4p + 4g + 5)]E(wgi)E(wpj) 

1.15) 

+ [p + ^]E{wgi)E(wpi) + [g + l]E{wgi)E{Wpj) + E(wqi)E(Wpi)}, 
We can now write the formula for <r(rpr[)y valid for p + g < n — 2, 

fj J\ ^ I + 2) + p(2g“ + 8g + 5) + (2g + l)(g + 2) 

-(j + 2)!(p + 2)l- 


. 2 _ ( g + l)(g + 2) + (p + l)(p + 2 ) 

(p + ? + l)g!p! (p + g + '3)(g + 2)!(p + 2)!' 

(4.16) 

_o P + g + 2 ,(G + 1) 

(p + g + 3)1 (G + 2)!j 

^ /p’(2g + 2) + p‘(2q‘ + 8g + 5) + p(2g* + 8g* + 6g - 2) + (2g’ + 5q‘ -2q-6) 

I (g + 2)!(p + 2)! 


o p + g i(p + g + 2)[(p + i)(p + 2) + (g + i)(g + 2)] 

^ (p + g'+ l)g!p! ' (p + g + 3Kg + 2)!(p + 2)1 

I 9 (P + g)* + 3(p + g) + l_(?® + G — ll 
(p + g+ 3)! (G + 2)! 

where G — MaxyCp, g). Setting p = g we obtain 


<r*(r;) = 2n 



(p + l)(2p* + 4p + 1) 
(p + 2)I(p + 2)! 


+ 2 


(2p + l)p!p!, 


_ 2_ I _4 P + ^ + PjlAl 

'^(2p + 3)(p +2)!p! (2p + 3)l^(p+2)!/ 

+ 2/2 (P + l)'(3p* + 4p - 3) _ P 

(p + 2)l(p + 2)! (2p + l)p!pl 


. p +»1 , „ 4p’ + 6p + 1 _ p* + p - 1 

(2p + 3)(p + 2)lp! (2p + 3)! (p + 2)1 
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Setting p 
(4.18) 


1 , we have 


<r\ri) = ff*(r) = 


i 6 n - 29 


90 


The value of <T®(r) was given by Bienaym 6 [2]. 
Finally, we evaluate 


ciTpVg) ^iXpiWqi) “b E(Wgi Xp/) “b EI(^Xpi'Wqj^ 


(4.19) 


- lE^(*pi)£(w,<) + EE fi(«>^)£^(*«) 

< «;<<+g +2 

+ EE EME(w,d]. 

«/<<+P+8 


f 1 i ^ i 

Jjei the symbol i?,y = 7 . 


(4.20) 


Then Xpi Wqi s , and 

5 ^ E(XpiWq^ = fJpq[E(Tp)], 


The remaining terms of (4.19) introduce no new difficulties, and for p + g < 
w — 3 we obtain 

f p^(2q + 2) + p‘i2^ + 13? + 12) 

+ p( 6 g* + 22? + 23) + (2?" + 6 ? + 15) 


c{rpr^) = 2 n 


+ 


(p + 3)!(? + 2)! 
p - g 


(p + ? + l)p!?! (p + ? + 2 )(p + 1 )!(? + 1 )! 


+ / _ -r 


(p - ? + 1 )(? + 2 ) 


,+ 


(p + 2)(p + 3) + (? + 1)(? + 2) 


(p + ? + 3)(p + 2)I(? + 2)! ' (p + ? + 4)(p + 3)!(? + 2)! 


- 2 


(p + ?)* + 5(p + ?) + 5 


(p + ? + 4)! 


+ Vpq 


p^ + 3p+l 
'■(p+3)! ' 


(4.21) 


+ 2 


y(2? + 2 ) + p\2q^ + 13? + 12)' 
+ p*(2?’’ + 13?* + 26? + 24) 


- 2 


(p + 3)!(? + 2)! + p(eg,« 4 . 22 ?* + 19? + 27) 
L+ ( 2 ?* + 6 ?* + 10 ? + 25) 

P + ? _ p* + 2p - ?* + 2 


(p + ? + Dpi?! (p + ? + 2)(p + 1)!(? + 1)! 
(p + 2)[(p + 2)(? + 3) - 1] - ?(? + 1)(? + 2) 

(P + ? + 3)(p + 2)!(? + 2)!' 

(p + ? + 3)[(p + 2)(p + 3) + (? + 1)(? + 2)] 


(P + ? + 4)(p + 3)!(? + 2)1 

, o (P + 9)‘ + 6 (p + ?)* + 8 (p + ?) - 1 _ 

■ ^(p + ? + 4)! ■ 

where is dehned as in (4.20). 


P* + 3p* 


p — 4 


(P + 3)1 
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6. Hie use of runs up and down in tests of significance. Certain miscon¬ 
ceptions about the application of runs up and down have appeared in the litera¬ 
ture, and it is the purpose of this section to clarify them. 

Since Eixp), «r*(rp), E(r) and «r*(r) are all of tiie order n, it follows that rp/r 
converges stochastically to 


Let 


\p = lim 

n-»oo 


E(rp) 

m * 



Ejr'p) 
E(r) • 


From (4.1) and (4.2) we have 

M 

X» 

Xi 


5 

8 

n 

40 

ii 

240 

29 

1680 

Jl_ 

280 


= .6250 
= .2750 
= .07917 
= .01726 
= .00357 


Let 


^ E(rp) 

E(r) 


Some writers say that \pn or X, is “the probability of a run of length p.” If 
the stochastic process consists in obtaining a sequence from among the n! se¬ 
quences S, each of which has the probability (re!)“*, then the phrase “the prob¬ 
ability of a run of length p” has no meaning. One can speak of the probability 
of at least one run of length p (i.ei, that Vp > 0), of the probability of no run of 
length p (rp = 0), of the probability that the first or fifth run (if there are five 
runs) in the sequence 5 be of length p, etc. It is possible to give different sto¬ 
chastic processes in which “the probability of a nin of length p” will have 
meaning and be Xp„, or Xp. Consider, for example, the totality of all the runs 
in the n! sequences S. There are nlE(r) of them, and among these there are 
n!E(rp) runs of length p. Now let the stochastic process consist in drawing a 
run from the totality of all these runs, each of which is to have the same prob¬ 
ability, which is therefore [n!F(r)]'‘. Then the probability of drawing a run of 
length p is Xp« . It is difficult to see how this stochastic process can have rele- 
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vance to most of the problems of quality cmitrol and ccqnmnie time series where 
runs up and down are now used. 

Some writers on quality control and economic time series recommend that 
statistical control or randomness be tested by use of di, • • • , d^i, d',, where 

d. = r< - E(ri), (i =1,2, , (p - 1)), 

dp = r'p - E{rp). 

The availabihty of the covariance matrix M oidi, • ■ • , dp_i, d'p , which we have 
obtained in this paper, will assist in the construction of such tests. Also of help 
will be a result recently announced by one of us |7], the early publication 6f which 
is expected. This result states that in the limit with n the joint probability 
density function of di, • • •, dp-t, d'p, is where A is a constant and Q 

is a quadratic form in di, • • • , dp_i, d'p whose matrix is the inverse of the matrix 
M. It follows immediately that Q has in the limit the x* distribution with p 
degrees of freedom. 

We wish now to make a few remarks about the tests of significance, based on 
runs up and down, which are used by some contemporary writers. A descrip¬ 
tion of their method can perhaps be best given by an example. With n = 100 
and p = 5, say, suppose the observed values are: 

Observed Values 
r, = 30 
r, = 10 
rs = 4 
r4 = 3 
ft = 3 
Total, r = 50 

These writers then say that the expected values are: 

Expected Values according to some writers 
Ein) = rXi = 50 (.6250) = 31.25 

Ein) = rX* = 50 (.2750) = 13.75 

E{n) = rXs = 50 (.07917) = 3.96 

JS:(r4) = rX4 = 50 (.01726) = 0.86 

Eir'i) = rXj = 50 (.00357) = 0.18 

50.00 

The coirect expected values are given by (4.1) and (4.2) and are: 

Correct Expected Values 
E{ri) = 41.75 

E{Ti) = 18.10 

Eixt) = 5.15 

E{rt) = 1.11 

E(x'i) = 0.22 

66.33 
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m 

It should be noted that: 

(a) A ccmsequence of the incorrect method of obtaining '^expected values^' 
is that, since 

E(r) « E(xi) + E(r 2 ) + J?(ri) + E^u) + Eiu), 

it implies that the expected number of runs of all lengths is equal to the deserved 
number! This is obviously erroneous. In fact it follows from (4.18) and the 
results announced in [7] that r — E{r) is in the probability sense of order -\/w. 

(b) By using the incorrect expected values for comparison with the observed 
values one loses the valuable information furnished by r — E{r). If this is large" 
(in terms of its standard deviation) it is plausible to question whether statistical 
control or randomness exists. 

6. Summary. Let S = (hi , • • • , An) be a random permutation of the n 
unequal numbers ai, • • • , On, and let R be the sequence of signs (+ or —) 
of the differences — hi (z = 1, • • • , n — 1). It is assumed that each of the 
nl sequences S is equally probable. A sequence of p successive + (—) signs 
not immediately preceded or followed by a + (—) sign is called a run up (down) 
of length p. Let and be the number of runs up and down in R of lengths p 
and p or more respectively. In this paper the exact values of (see 

formula (4.10)); <r*(rp), (formula (4.11)); c{rpr[), (formula (4.16)); <r*(rp), (for¬ 
mula (4.17)); and cirpVf,)^ (formula (4.21)) are derived. A few numerical 
values are: 


An) 


2/ \ 305n —• 347 

, »v;) - 

19n + 11 


16» - 29 


90 

<r(rirs) = — 

41n — 99 


Ar.) = 

67n - 43 
720 ’ 


51106n - 73859 
463600 

2 , /« 21496n - 51269 

. 453^---’ 


630 ’ 
airiTi) = — 


210 ’ 
ffCrs/,) = 

363» - 817 
~ 5040 ■ ’ 


/ / /v 5w 3 

<r(rir2) = 


60 


23n + 135 


1260 ’ 
and virtr'i) = — 


<r(nri) = 




117n - 79 


720 

18346n - 49019 
453^ 


The values of Eivp), (formula (4.1)); and E{rp), (formula (4.2)) are also 
given. Ctertain misconceptions about the applications of runs up and down 
are discussed. 
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ON THE MEASURE OF A RANDOM SET 


By H. E. Robbins 

Post Graduate School, U. S, Naval Academy 

1* Introduction. The following is perhaps the simplest non-trivial example 
of the type of problem to be considered in this paper. On the real number axis 
let N points Xi {i — 1, 2, • • • , N) be chosen independently and by the same 
random process, so that the probability that Xi shall lie to the left of any point 
a? is a given function of x, 

(1) e{x) = Pr (xi < x). 

With the points x, as centers, N unit intervals are drawn. Let X denote the 
set-theoretical sum of the N intervals, and let yiiX) denote the linear measure 
of X, Then /i(X) will be a chance variable whose values may range from 1 
to N, and whose probability distribution is completely determined by <t{x). 
Let r(u) denote the probability that ti{X) be less than u. Then by definition, 
the expected value of tiiX) is 

(2) EiniX)) = j\dT(u), 
where 

(3) t(u) = Pr iix(X) < u). 

The problem is to transform the expression for E(n(X)) so that its value may be 
computed in terms of the given function aix). 

In order to do this, we observe that, since the x, are independent, 

(4) t{u) = /•••/ da{xi) • • • d<T{Xff)y 

C(u) 

where the domain of integration C(u) consists of all points (xi, • • • , Xn) in 
Euclidean AT-dimensional space such that the linear measure of the set-theoretical 
sum of N unit intervals with centers at the points x, is less than u. Here, how¬ 
ever, a difficulty arises. Due to the possible overlapping of the intervals, the 
geometrical description of the domain C{u) is such as to make the explicit evalua¬ 
tion of the integral (4) a complicated matter. 

The difficulty is even more serious in the analogous problem where instead of 
N unit intervals on the line we have N unit circles in the plane, with a given 
probability distribution for their centers (x,, y,). Again we seek the expected 
value of the measure of the set-theoretical sum of the N circles. The correspond¬ 
ing domain C{u) in 2Ar-dimensional space will now be very complicated. 

It is the object of this paper to show how, in such cases as these, the expected 
value of ^(X) may be found without first finding the distribution function r(u). 

70 
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In fact, the theorem to be stated in (15) will in many important cases yield a 
comparatively simple formula for E(jt(X)). 


2. Expected value of In order to state the problem in full generality, 

let us suppose that X is a random Lebesgue measurable subset of Euclidean n 
dimensional space E„ . By this we shall mean that in the space T of all possible 
values of X there is defined a probability measure p(X) so that for every ^ 
measurable subset S of T, the probability that X shall belong to S is given by 
the I^ebesgue-Stieltjes integral 

(5) Pr{X eS) =[ Cs(X)dp(X), 

Jt 


where the integrand is the characteristic function of aS, 


( 6 ) 


Cs(X) - 


1 

0 


for 

for 


XeS 

XiS. 


In practice, the set X will be a function of a finite number of real parameters 
(e.g., the coordinates of the centers of the intervals or circles considered in the 
Introduction), X = X(ai, • • • , ar) = X(a). There will be given a probability 
measure v{a) in the parameter space Er , so that a will be a vector random 
variable in the ordinary sense. If A is any r-measurable subset of Er , then by 
definition, 

(7) Pr(a€^) = f Ca{oi) dv{a), 

JEr 


Now for the set S' consisting of all X such that X = X(a) for a in il, we define 
p(S') = y(A). Thus a p-measure is defined in the space T of X, which is the 
general situation considered in the preceding paragraph. 

Returning to the general case described in the first paragraph of this section, 
we shall now prove the main theorem of this paper. To this end we define, for 
every point x of En and every set X of T, the function 


( 8 ) 


X) 


|l for a; c X 
|o for X 4 X. 


Moreover, for every x in /?n we let S(x) denote the set of all X in T which contain 
X. Then for every x in En we have from (6), 

(9) X) ^ Csm(X). 

Let us denote the Lebesgue measure in En of the set X by m(X). Assuming 
that the function g(x, X) is a pp-measurable function of the pair (x, X) in the 
product space^ of En with T, it follows from Fubini’s theorem^ that 

(10) f g(x, X) dpp(x, X)^ f f g(x, X) dp(X) dp{x). 

JSnXT ^En^T 

^ See S. Saks, Theory of the Integral^ G. E. Stechert, N. Y., 1937, pp. 86 , 87. 
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From (5) and (9) it follows that 

(11) f g(z, X)dp(X) = Pr (XtS(x)) = Pr (x€X). 

Again by Fubini’s theorem we have 

(12) f g{x, X) dMx, X) = f f g(x, X) d^x) dp(X). 

Jg^XT Jr Jg, 

But from (8), 

(13) f gix,X)dp(.c) = f dp(x) ^ p(X). 

Now from (10), (11), (12), and (13) we have 

(14) f Pr (X * X) dpix) = f p{X) dp(X). 

Jg„ Jt 

But the latter integral is equal to E(ji(X)). Hence we have the relation 


(15) E{piX)) = / Pr (x 6 X) dp{x). 

*' Mfi 

This is our fundamental result. We may state it as a 
Theorem: Let X be a random Lebesgue meamrable subset of En , vuth measure 
/x(X). For any point x of En let p(x) = Pr {x tX). Theriy assuming that the 
function g{x, X) defined by (8) is a measurable function of the pair (x, X), the ex¬ 
pected value of the measure of X will be given by the Lebesgue integral of the function 
p{x) over En . 


3. Higher moments of y(X). We may generalize the result (15) to obtain 
similar expressions for the higher moments of y{X), For the second moment 
we have the expression 

(16) FiAX)) = f AX)dp(X). 

Now from (13), 

p\X) = m(X)-m(X) = f g{x,X)dp{x)- f g{y,X)dp{y) 

•'if* •'ifn 

(17) , , 

= / / g{x,Xyg{y,X)dp{x)dp{3i). 

JUn J*n 

Let 


(18) 


g(x, y, X) = g(x, X)-g{y, X) 


1 if Z contains both x and y 
0 otherwise. 
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llien from (16), (17), and (18), we have as before by Fubini's tiieorem, 

£?(A-X’))»/ / / 9(x,y,X)d^^(x)d^(y)MX) 

(i») ' *; 

= I I f g(x,y,X)dp(X)dMx)dM(y). 

Jm, Jtn Jt 

But from (6) and (18) it follows that 

(20) f g{x, y, X) dp{X) * Pr (a: e X and y t X). 

J f 

The latter probability may be denoted by p{x, y). This function will be defined 
over the Cartesian product, , of En with itself. Let nix, y) denote Lebesgue 
measure in . Then from (19) we have 

(21) EiAX))=f p{x,y)dp{x,y), 

•'* 4 « 

where 

(22) p(x, y) — Pr {xtX and y e X). 

The formula for the mth moment of piX) will clearly be 

(23) Exp {p'i.X)) - / p(xi, X 2 , ,Xn)dp(xi,Xi, • • •, Xm), 

where p{xi, Xt, • ■ • , Xm) denotes Lebesgue measure in E„„ and where 

(24) p(xi, Xj, • • • , Xm) = Pr (xi t X and Xs « X • • • and Xm e Z). 

In the next section we shall apply formulas (15) and (21) to a specific problem. 


4. Let a, p, B be given positive numbem such that {B + o)p < a and a < B. 
We shall define the random linear point set Z as follows. N intervals, each of 
length a, are chosen independently on the number axis. The probability 
density function for the center of the tth interval will be assumed to be constant 
and equal to p/a in the interval —a/2 < x < B + (a/2); it may be arbitrary 
outside this interval. The set Z is now defined as the intersection of the fixed 
interval 7: 0 < x < J5 with the variable set-theoretical sum of the N intervals. 
The hypothesis of (15) is clearly satisfied. The probability that any point x 
in the interval I shall be contained in the fth interval of length a is clearly 
(p/a)a — p. From this it follows that 


(26) 


Pr (x « Z) = p(x) = 


1 — (1 — p)^ for 0 < X < S 
0 elsewhere. 


From (15) it follows that 

(26) E(p(X)) = p(x) dx = B(1 - (1 - p)^). 
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(The same fcnmula holds in the case where the N intervals of length a are re¬ 
placed by N circles of area a and 7 by a plane domain of area B, provided that 
for every pcdnt of the domain the probability of being contained in the ith circle 
is equal to a constant p. A similar remark holds for spheres in space.) 

To evaluate E(ji^(X)) in the linear case we make use of the identity 

(27) Pr (A and B) = Pr (A) + Pr (B) + Pr (neither A nor B) - 1, 

which holds for any two events A and B. It follows from (27) and (25) that if 
X and y are any two points of I, then 

p(*> y) = Tr {x eX and y tX) 

(28) = Pr (xtX) + Pr (ytX) + Pr (xiXfmdyiX) - 1 
= 1 — 2(1 — p)^ -f- Pr (x < X and y < X). 

Let 

(29) h(x, y) = Pr (x 4 X and y 4 X). 

Then 

f[l - (p/o)2o]'' = (1 - 2p)", for I p - X I > o 

(30) h(x, y) = [1 - (p/a)(o + |p - a:|)]" = ~ , 


for 1 1 / — a? I < a. 

Now from (21), (28), and (29) we have 

B(AX)) = f f V{x,y)dydx 
Jo ■'0 

(31) = f f [1 - 2(1 - p)" -t- h(x, y)]dydx 

Jo Jo * 

= B*[l - 2(1 - p)"] + 2 r r hix, y) dydx. 

Jo Jx 

When the latter integral is evaluated the result is 

E{i,\X)) = B*[l - 2(1 - p)"] + (B - o)*(l - 2p)^ 

. 2aB(l - p)^"‘ _ 2o(B - o)(l - 2p)^+* 
■(F+l)p " (AT+Dp 

- (i,-+W+ 

Combining this with (26), we find for the variance of ti{X) the expression 
= B(m*(X)) - lB(M(X))r 
(33) = (B - a)’(l - 2vr - B\\ - p)*^ -|- 


2o(B - o)(l - 2p)''+‘ 
{N + l)p 


(iV-ri)(iV + 2)p» 


{N + l)p 

[(1 - pY^ - (1 - 2pY^]. 



ON THE DISTRIBUTION OF THE RADIAL STANDARD DEVUTION 

Bt Frank E. Grubbs* 

Aberdeen Proving Oromd 

1. Introduction. Of interest in the field of ballistics is a measure of the 
accuracy of bullets. In acceptance tests of small arms ammunition lots, for 
example, a sample of roimds from each lot is fired from a fixed rifle at a vertical* 
target placed a specified distance from the rifle. The accuracy of the bullets 
is taken to be some measure of the scattering (or lack of scattering) of the bullet 
holes on the target. The purpose of such a test would be to determine whether 
or not the lot under consideration differs significantly in accuracy from (a) 
standard values or (b) its predecessors. 

One useful measure of accuracy is the radial standard deviation which is 
defined by the relation 

(1) ^ “ /j/^ + S(y,- - f/)*), 

where and yi are respectively the abscissa and ordinate of any point measured 
from an arbitrary origin and N is the sample size. 

It will be the purpose of the present discussion to call attention to a series 
expansion for the distribution of the statistic Z in samples of N assuming that 
the distribution of all rounds of the lot on the target follow the bivariate normal 
population law 

(2) f{x, y) = n - e , (x and y statistically independent) 

^7r<Ti cr2 

where <ri and <rl are the parent variances of x and y respectively. In the above 
probability density function, the population means are taken to be zero since 
the statistic Z is quite independent of the origin selected. 

2. - Moment generating function of Z*. The distribution of si = ^ 2(x{ — if 
in samples of N from a normal population is given by the well-known law, 

JL 

(3) dF(«?) = \^\) ^ 0. 

\ 2 / 

The moment generating function of may be found (in a neighborhood of 
< = 0) by straightforward integration: 

(4) M,x{t) = E(e*J') = e*f‘dF(8?) = |l - 

' Captain, Ordnance Departnlent, Ballistic Research Laboratory, Aberdeen Proving 
Ground, Md. 
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Likewise, for si 


1 

JV 


S(»< - f?)*, we have 


( 6 ) 

Now Mzt(t) = Af,{+,|(<) = = E(e'^*)‘E(e‘^*) since x and y are in¬ 

dependent. Thus, 


Mzt(t) 







3. Distribution function of 2r 




Making use of the Fourier theorem, we have 




at all points of continuity of /(2*). 

The discussion will be divided preferably into the two cases: Case I: cri = <r|, 
and Case II: at ^ at • 

Case I: a\ — trl = a. 

In this case the distribution of reduces to 


(7) 



N ] 


e 


(it. 


Tt will simplify the algebra to find first the distribution of ?/ 
that of Z*. Since = {1 — 

(8) ■ f(u^) = ^ {I - 


NZ^ 

2<r^ 


and then 


This integral may be evaluated easily by the calculus of residues since the inte¬ 
grand has only a single pole of order (JV — 1) at < = —i. We will, however, 
make use of the following method. 

Put —« = «* — iuh; then 


(9) 


/(«*) = 


1 


mV iu* 

2ir . 


—c" 


/*—f**-*tOO 


2in 


dv. 


The integral in the last expression is Hankel’s integral [1]; namely. 

Therefore /(«*) = 
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and <^(2?“) = 1 ) 


.( 11 ) 


dF{Z) = 



r(iV - 1 ) e~^ Z*^~‘dZ. 


(Note that /(Z) is continuous over 0 ^ Z < «.) 

This expected result has been obtained by Reno and Mowshovdtz [2] who 
employed an extension of the famous Helmeii distribution. 

Actually, the result is an obviotis one and may be argued as follows: N^\/a 
is distributed as x 'with N — I degrees of freedom and Nal/o^ is also distributed 

as X* with N — 1 degrees of freedom. Hence, the statistic («* + «*) is, from 

<r 

the additive property of x> distributed like x* with 2N — 2 degrees of freedom. 

We now turn to the general 
Case II: <r\ 9^ al 

No generality will be lost by taking a\ < a\. In fact, the present attack will 
hold with obvious modifications provided a\ < 2^1 . 

Recall that 


(12) /(Z*) 


= ir 

2^7 jLa, 




' N 


€• dtf 


at all continuity points of /(Z^). 

In a manner analogous to that employed by Hsu [3], we replace 

1 - <r\h\ \ 

V-2a\it/N]' 




Further, since 


we may write 

- f'r' 
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Thus, 


(13) f(Z^) 




2\J(JVri) 


2t 


/ oo oe 

E 

00 fMO 


JlJL 


{' - ¥'} 


-(AT+r-l) 


dt, 


1 for r = 0. 


Avith the understanding that r/S = 

We note that the moduli of the terms of the above series are for all t not greater 
than the corresponding terms (tf the following convergent series of positive terms.: 

«\r 


UL 


Therefore, uniform convergence over (—«, «) is established. To show that 
we may integrate over the infinite interval term by term, we observe that 

I S(t) — Sr(t) I < epit) for all t and all large r, where 

S,{t) = the sum of the first r + 1 terms of the series, and the function = 

2<rlitr 


1 — —^*1 which is integrable over (—«, «). That is, Srif) converges to 
/S(0 uniformly relative to ^(0-* Hence, 


(14) /(Z0 = 


= W_ y- V 4 / r/i _ 2^A 


-iZ^t 


dt. 


We have already carried out the integration under Case I with the exception 
that (iV — 1) should now be replaced by (A^ + r — 1). The distribution of Z* 
will then be given by 

_2\r jy_ 

2a\ 


df(Z0 


(16) 




2\5(Ar~l) 00 

z 


r-0 




TiN + r-l) ■ 


NZ* 
~ 2cr? 


/^Y+r- 


d{Z^). 


‘ \2«r?/ 

* The author is indebted to Prof. E. J. McShane for this definition which is due to Prof. 

f OO M 00 

Srit) dt ^ I lim Sr(t) dt, 

00 JLqo r-*oo 
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Finally, the distiibution fimotitsi of^ is 

UI 

We remark that the above series expansion holds, of course, for N odd or even. 
In case N is odd it may be shown that the distribution function may be expressed 
as a finite series of Incomplete Gamma Functions.’ However, the finite expan¬ 
sion for N odd appears to offer no marked advantage since for computational 
purposes the infinite series expansion ccmverges quite rapidly (N either odd or 
even) and may be put into a convenient form given below. 




4. Computational form for the distribution function. In deciding whether 
or not an observed value of Z is significant and likewise in control chart proce¬ 
dure, one is interested in the percentage points of fjZ ), For example, it may be 
desired to find the value of k such that P{Z < ky/a\ -f alj = .995, say, for 
various sample sizes N. In this conne ction it will be convenient to work with 
the distribution of Z*, for P{Z < ky/a\ +^} = P{^ < + vl)j also. 

Now, 

P{Z^ ^ k\iT\ -t- ai)) = / ■ dF{Z^) 




i(iV-l) 00 

£ 

r->0 


/, (K\ 

V vj/_ 

JN - I \r(iV + r- 

i ■ 


''-i! fNZ'^Y 

{m) 


d{Z\ 


since we may integrate the series term by term over the entire range of or 
any part of it [5]. In the terminology of Karl Pearson’s Incomplete Gamma 
Function [3], 


(19) = rlp+^ 

we may write the above series in the form 
P{Z' ^ k\<Tl + cl)\ 


fttVp+i 


== 


0 § 

5/ 


OJL 




+ r-2 

2y/N + r — 1 


• Prof. C. C. Craig kindly pointed out this fact to the author. 
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It is indeed convenient and enlightening that the result is a function of the 
ratio, al/cl, and not < 7 * and/or <r| explicitly. 

Hence, for a given sample size and ratio of v\/o\ , we may find k by inverse 
interpolation such that P{Z ^ ky/eX + <r|j = a, any desired level of probability. 


6. Moments and percentage points for Case I. For the case met manj’^ times 
in practice, i.e. <rl = at — (t®, we will give a table of the mean and standard devia¬ 
tion and also several probability levels which are obtainable directly from the 
percentage points of the distribution [ 6 ]. 

From ( 11 ), we have 

JV2* 


(21) 

^(2") ~ 1) J 

r(JV - 1 - 1 - fc/2)/2<r*Y* 

HN - 1) \N/ • 

Thus, 


(22) 

, _r(Ar-i/2) /2 
r(N - 1 ) y N ’ 

(23) 

/ 2(N - 1) * 

and 

(24) 

2 /.r , rr(iV- l/2)T\_s 

1 |_r(Ar-l)Jj 


In the table below, the mean and standard deviation are given as a multiple 
of \/ 2 <r and fc.w , for example, is that value of k such that P{Z < ky/2(i\ = .95. 


TABLE I 


N 

Mean 

Standard 

Deviation 

Percentage Points 


A;. 06 

k,n 


2 

.6267 

.3276 

.0501 

.1602 

1.2239 

1.6276 

3 

.7676 

.2786 

.1857 

.3442 

1.2575 

1.5738 

4 

.8308 

.2443 

.2906 

.4521 

1.2546 

1.5226 

5 

.8670 

.2198 

.3667 

.6227 

1.2453 

1.4817 

6 

.8904 

.2014 

.4239 

.6730 

1.2351 

1.4488 

7 

.9068 

.1869 

.4686 

.6110 

1.2265 

1.4218 

8 

.9189 

.1752 

.6046 

.6408 

1.2167 

1.3991 

9 

.9282 

.1653 

.5345 

.6651 

1.2087 

1.3798 

10 

.9356 

.1569 

.5597 

.6852 

1.2014 

1.3630 

11 

.9416 

.1498 

.6813 

.7023 

1.1949 

1.3483 

12 

.9466 

.1434 

.6001 

.7170 

1.1889 

1.3353 

13 

.9508 

.1378 

.6166 

.7298 

1.1835 

1.3237 

14 

.9544 

.1330 

.6313 

.7411 

1.1784 

1.3132 

15 

.9576 

.1286 

.6445 

.7512 

1.1738 

1.3038 
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A MATRIX PRESENTATION OF LEAST SQUARES AND CORRE¬ 
LATION THEORY WITH MATRIX JUSTIFICATION OF 
IMPROVED METHODS OF SOLUTION 

By Paul S. Dwyer 
University of Michigan 

1. Introduction and summaiy. It is the aim of this paper to exhibit, by using 
elementary matrix theory, the basic concepts of least squares and correlation 
theory, the solution of the normal equations, and the presentation and justifica¬ 
tion of recently developed and newly proposed techniques into a single, com¬ 
pact, and short presentation. We shall be mainly concerned with the following 
topics: 

a. Basic least squares theory including derivation of normal equations, the 
theoretical solution of these equations (regression coefficients), the standard 
errors of these solutions, and the standard error of estimate. 

b. The more specific theory (correlation theory) resulting from applying the 
general least squares results to the standardized distributions. 

c. A matrix presentation of the Doolittle solution. 

d. A simple matrix justification of methods, previously presented, for getting 
least squares and multiple correlation constants from the entries of an abbre¬ 
viated Doolittle solution. 

e. A presentation of a more general theory which the matrix presentation 
reveals. 

f. The outline of a ^‘square root’’ method as an alternative to theDoolittle 
method. 

The reader should be familiar with elementary matrix theory such as that out¬ 
lined on pages 1-57 of Aitken’s book [1]. 

No previous knowledge of the Doolittle technique is demanded although a 
familiarity with the notation and contents of two earlier papers [2], [3] is advised, 
particularly for those who are interested in the computational aspects. 

The presentation here is theoretical and is not concerned with such compu¬ 
tational topics as the number of decimal places required, etc. With reference 
to the number of places, the reader is referred to the recent paper of Professor 
Hotelling [4]. 

2. Notation. Let [xij] with 1 < i < N and 1 < j < n be the nhy N matrix 
of observed variates of n “predicting variables” for N individuals with i indicat¬ 
ing the individual and j the variable. Let [y\] be the one by N column matrix 
of the observed variates of the “predicted” variable. Let the matrices of devia¬ 
tions from the variable means be indicated by [xa] = X and [y^ = F. Then by 
the least squares hypothesis we are to find numbers byi,.,. , 6 ^ 2 . .., • • • , hyn.... 
such that 

Oi ^ y% (Xiibyi,... “f" Xi2by2»»»* -r|~ * • • "I" ^in^yn*...), 
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shall have a minimu m variation (standard deviation). We then denote the 
by{..,. by the one by n column matrix B and the et by the one by N column mar 
trix E and have 

(1) E ^ Y - XB, 

as the basic matrix equation. 

It may be noted further that the fitted values of yi are given by the one by N 
matrix product XB == Y* Using this notation (1) appears as 

(10 E - Y. 


3. Basic least squares theory. 

a. Sum of squares of residuals. The condition for minimum variation in this 
situation (variates measured from means) is equivalent to the condition for 
minimum sum of squares of residuals. In matrix notation this sum of the 
squares of the residuals can be written 

(2) E'E with E = Y -- XB = F — JT, and J?' the transpose of E. 

b. The normal equations. Differentiating (2) with respect to J8' we find the 
necessary condition to be 

(3) X'E = 0. 

This matrix ec^uation gives the normal equations in implicit form. More ex¬ 
plicitly by (1) we have X'{Y — XB) = 0 so 

(4) X'XB = X'Y. 

The reader should immediately recognize that (4) is the matrix equivalent of 
the usual statement of the normal equations where deviations from the means 
are used. It should be noted also that (3) and (4) can be written in the form 

(5) X'Y = X'Y from whence at once Y'Y = Y'Y. 

c. Solution of normal equations. The theoretical solution of (4) is accom¬ 
plished at once and results in 

(6) B = (X'X)~^X'Y = (X'X)~'XT. 

d. Standard devialion of residuals. The standard deviation of residuals is 

Ve'e/n. 

In order to evaluate this we note that 

(7) Y'E = B'X'E = 0, and TT = Y'Y, 

Thus 

(8) E'E = (F - YYE = Y'E = Y'{Y - XB) = Y'Y - Y'XB, 

and 

(9) Y'XB * Y'Y = TY = Y'Y, 
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Since Y'Y — we have 
( 10 ) 

so that, dividing by N and taking the square root 

«■), 

If the relation between the estimated standard deviations in the population is 
desired then divide each side of (10) by the number of degrees of freedom and get 

Alternative formulas to (11) and (12) are obtained by replacing Y'XB by its 
ecjuivalent expressions in (9). 

6. Formulas for multiple correlation coefficient. It is to be noted that the 
numerical (quantity Y^XB/'Ly^ plays an important role in measuring the ratio 
(ic/(Ty . It is customary to use this quantity as the definition of the square of 
the multiple correlation coeflicient so we have 

/ION .2 -- ^ B'X'XB _ Y'XiX'Xy'X'Y _ Y'Y _ 4 

{6) Tyrr-Xn - ' 2j/* “ .. Yf Sy* <rj' 

/. Formulas for correlation coefficient. When n = 1, X'X == F'X = 
X'F = SXF, B and (13) gives 

fl4) r = = hA/^ = 6 ^ 

Many of the above developments can be duplicated, without formal use of ma¬ 
trix theory, by judicious use of symbolism and substitution. See for example 
the presentations of Kirkham [5], Bacon [6], and Guttman [7]. 

g. Errors of regression coefficients. If Bo is an approximation to B such that 
jBo + AB = B then (6) can be written 

Bo + AS = {X'Xy^X'Y 


and 

(15) AS = (X'Xy^X'iY - XSo). 

This formula can be used in finding corrections AS necessary to change any 
proposed trial solution, So, into a correct solution. It could also be used in 
extending the accuracy of a solution after an approximation had been secured 
to a specific number of places. It has greater utility however in another problem. 
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We suppose that the predicting variables, the x% contain no «toi 8 but that 
there are errors in the observed values oi y. Let tiie hypo<lie;tieal observed 
values ot yh^ indicated by Y and the recorded observed values q[ y hy Yt . 
Let the values of .Bo be the regression coefficients obtained by using the recorded 
observed values Fo,. Thus AB = 0 when Y is replaced by Fo in (16). Now 
let F — XBo, the “true” residual errors of the recorded observed values, be 
indicated by E. Then (15) becomes 

(16) AB = (X'X)"‘X'B. 

Sampling theory can be applied to (16) to obtain a formula for the standard 
error of the regression coefficient. It is assumed that the “true” residual errors 
are independent with a common standard deviation <r,. The values of AB are 
then linear functions of these errors. It follows that 

(17) - ala = (X'X)“‘X'X(X'X)-V* = (X'X)-V*. 

The standard errors of the regression coefficients are thus formed by multi¬ 
plying <Te by the square roots of the diagonal terms of the inverse of X'X. 

4. Standard variates. Use of correlation matrix. Many of the formulas of 
section 3 are simplified with the use of some type of standardization. In par¬ 
ticular it is possible to reduce the matrix X'X to the matrix R of correlation 
coefficients by replacing x by tz/N where 4 = If y is similarly replaced 

and B by B, then X'Y = Rzy and TX = R'zy , FF = V = 1 and selected 
formulas from section 3 become 

(18) RB = Rzy 

(19) ^ B = R^^Rzy 

(20) rlz,,..., = RsyB = B'RB = R'zyRT^Rzy . 

Classical multiple correlation formulas, determinantal and otherwise, are 
^‘covered” by the matrix formulas (20). 

6. Matrix presentation of a Doolittle solution. Least squares and correlation 
constants can also be obtained from the entries of a Doolittle solution. We first 
outline a matrix description of the Doolittle solution of the equation AX = G 
with A = [a< J symmetric and of order n. 

Let Si be a {n by n) matrix with the first row composed of the elements an 
and all other elements 0. Let Ti be a similar matrix with first row eliments 
b\j = aij/an and all other elements 0. Then A — S\Ti = Ai = [a,y.i] is a sym¬ 
metric (n by n) matrix with all elements of the first row and the first column 0. 

Next let Sa be a {n by n) matrix with second row elements oay.i = oa/ — oaibi, 
and all other elements 0. Let T^he a {n by n) matrix with second row elements 
thj.i = and all other elements 0. Then it follows that the matrix 
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Ai — SiTi — [a»;.i 2 ] is a symmetric {n by n) matrix with the elements of the 
first 2 columns and the first 2 rows all 0. 

This process is continued through successive steps, an additional row and 
column being made identically 0 at each step, through n steps. At the end of n 
steps we have the result. 

(21) A - S[Ti - SiTi- ■■■ S'nT„ = 0. 

This development, when applied to each side of the matrix equation, provides 
the basis for an equation solving technique which Aitken has called the ‘^method 
of pivotal condensation*^ (8) but which the author feels is more adequately 
characterized as the ^'method of single division** (9). The Abbreviated Doolittle 
method can be obtained as an abbreviation of this method. It is not neces¬ 
sary^ to compute all the elements of the successive matrices A 1 A 2 • • • , etc. but 
only the non-zero elements of the Si, Ti, S 2 , T 2 • • * etc. matrices. 

Consider the so called triangular matrix S = Si + S 2 + +••• + Sn with 

its rows composed of the non-zero rows of the Sj . Consider also the matrix 
T Ti + To+-• + Tn. Then 

(22) S'T = S'lTi + S 2 T 2 + ••• + SnTn 

since SiTj- = 0 when i 9 ^ j. 

It follows that (21) can be written 

(23) A - = 0. 

An efficient way of building up these matrices S and T in practice and in making 
the corresponding transformations on the right side of the equation is the 
Abbreviated Doolittle method. It is apparent from (23) that the Doolittle 
method is directed, in part at least, toward the factorization of the symmetric 
matrix A into two triangular matrices. 

It should be noted that these triangular matrices are related by the matrix 
formula 

(24) S = DT, 

where D is the diagonal matrix with diagonal elements an, 0221 , Om 12 , 

ann-123-•-n —1 • 

Operations performed on the left of the matrix equations AX = G are also 
performed on the right side so that the Doolittle technique results in the estab¬ 
lishment of the auxiliary matrix equations. 

(26) ♦ SX ^ SA-'^G. 

(26) TX = TA-'G. 

A simple outline (n = 3) of the form of the Abbreviated Doolittle method is 
presented for the purpose of identifying these matrices. A is symmetric and G 
is the column matrix [ai 4 ]. 
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Tlie matrix iS is then 


Oil 

0 

0 


(Zl2 

022*1 

0 


Ol8 
028.1 
083*12. 


, the matrix T is 


“1 

bli 

61 s ■ 


Oi4 


"i>i4 ■ 

0 

1 

?>281 

, SA-^G is 

024-1 

, TA-'G is 

&241 

_0 

0 

1 


__084.12_ 


J>9An_ 


6. Least squares and multiple correlation constants from the Doolittle solu¬ 
tion. The inverse of A is needed for many formulas. We set up a technique 
for solving AY == I simultaneously with AX — G. This is indicated sym¬ 
bolically by 


A 

G 

I 

S 

SA-^G 

SA-^ 

T 

TA-'G 

TA-^ 


It follows at once that 

(27) = A-^S’TA-^ = 

This matrix multiplication is easily and readily accomplished when the matrices 
are in the Doolittle form. 

Similarly 

(28) {SA~^y{jrA~^G) — A~^G = X, the matrix of solutions of AX = G and 

(29) {SA-^G)'{TA-^G) = G'A’^G. 

It is interesting to note further that (SA~^)' and T are inverse triangular 
matrices since 

(30) {SA-yr = A~^S'T = I. 

S' and TA~^ have a similar relationsliip. 
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In the C5ase of least squares theory A = X'X, G = X'F, X * so that the 
formulas (27) (28) (29) become 

(31) == {X'Xy\ 

(32) (SA-^yiTA-'^G) ^ X ^ B. 

(33) (SA-'(?)'(rA“'G) = - F'X(X'X)^'X'y 

= Y^XB = B'X'X£ = JT. 

If the normal equations are reduced to standard form -4 = iB, X = B, G = R^y 
and we have 

(34) {SA^y{TA^^) = « Br\ 

(35) (SA“')'(r^~'G) = X = B. 

(36) (SA"'G)'(2rA“'G) = G'A’-^G = R'^^yRT^R^ = /g'^B = B'igB = 

The reader is referred to an earlier paper [3, 457] for an illustration of these tech¬ 
niques. 

It should be noted that the solution is a cumulative one in the sense that solu¬ 
tions involving n predicting variables are obtained from solutions involving 
n — 1 predicting variables by the addition of paired products. This is a highly 
desirable feature as it makes possible direct analyses showing the effect of an 
added predicting variable. 

7. A more general theory—solution of matrix equations by factorization* 

Examination of the results of section 5 leads one at once to a consideration of a 
more general theory. The key formula in this development is A — S'T = 0 
and all subsequent formulas stem from this. Hence if A can be factored into 
any matrices, /S' and T, not necessarily triangular, the results of section 6 follow. 

From a practical standpoint it is desired that the factorization process yield, 
simultaneously, the values /S, T, SA~^G\ TA~^G; >SA“^ and TA~^ as the Doolittle 
method does. But, formally, these can be computed if S and T are known. 

8 * A ^^square root” method. A most interesting and practical special case 
of the above method is that in which the triangular matrices S and T are equal. 
It appears that a technique based on this property would have some advantages 
over the Doolittle method since the double rows of the Doolittle solution could 
be replaced by single rows, while the formulas of sections 5 and 6 are just as 
applicable. Now such a technique is easily devised. From (23) and (24) we 
see that 

(37) A - S'D-^S = 0, 
where D is a diagonal matrix. 

We replace D^S by a new /S, (D“^/S)' by a new S' and have 

(38) A - S'S = 0. 
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The teduiique of sdutio n is s imilar to that of the Dodittle except that the 

entries are aij—./y/ajj . These values are thus geometric means 

of the values an.... and hn ..... 

A simple machine technique is available for computing these entries. In 
some respects tire solution is superior to the Doolittle solutimr. It is hardly 
pertinent to the subject matter of this paper to presmit a detailed discuslion of 
the merits of this method, with the numerical illustrations. This will be dtme 
in a later paper. 

After arriving at this method by the steps described above, it seemed surpris¬ 
ing that such a simple and compact method has not been discovered by some 
previous worker. Although matrix factorization is not a new subject, I have 
not found evidence that it has been utilized so directly in the problem of solving 
matrix equations. The nearest approach I have discovered is the paper by 
Banachiewicz [10], in which a “square root” method is used in factoring A. 
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ON THE STATISTICS OF SENSITIVITY DATA 

\ 

By Benjamin Epstein and C. West Churchman 
Frankford Arsenal 

1. Introduction* “Sensitivity data^* is a general term for that type of ex¬ 
perimental data for which the measurement at any point in the scale destroys the 
sample; as a consequence, new samples are required for each determination. 
Examples of such data occur in biology in dosage-mortality determinations, 
in psychophysics in questions concerning sensitivity responses, and, more 
recently, in the theoiy of solid explosives, in questions concerning the sensiti\nty 
of explosive or detonative mixtures. 

Methods of analyzing such data have been discussed by Bliss' and Spearman^, 
and others. The present paper is a generalization of Speaiman’s result; it is the 
feeling of the authors that Spearman^s method, if properly founded in mathe¬ 
matical theory, is preferable to Bliss', for it does not necessitate the assumption 
of some type of distribution prior to analysis, and hence resembles the standard 
treatment of independent observations made on the same object. 

Throughout the following discussion, we let Xi be the magnitude of a certain 
“stimulus" (be it dosage, physical stimulus, or strength of blow) and p* the cor¬ 
responding fraction of objects unaffected by the stimulus. Bliss’ method con 
sisted in assuming that the p, represented the cumulative distribution of some 
known function (in his case, the normal function), and hence the p» could be 
transformed into a variable U linearly dependent on the Xi . The difficulty of 
this treatment, in addition to the distribution assumption, lies in the fact that 
the ti do not have equal standard errors, and the straight line fit is verycumber- 
.some. 

Instead, Spearman makes the much simpler assumption that if pi is unaffected 
at Xi , and p,^.! at , then p,- — pj+i is an estimate of the fraction that is just 
affected (i.e., the fraction of thase that have “critical" responses) at about 
^(xt + Xi+i). If the Xi are evenly spaced, as we shall assume them to be through¬ 
out, and Pi = 1.0 and pn = 0, then any set of sensitivity data may be trans¬ 
formed into a set of data on critical responses classified into classes whose mid¬ 
points are evenly spaced. Without loss of generality, we shall assume the Xi’s 
to be integers and the intervals to be unity. The data on critical responses can 
then be treated in the normal way, and and all the measures of dispersion 
calculated in the usual fashion. In order to justify such procedures, however, 
it is necessary to show how the sampling errors of Jt and the higher moments 
can be estimated. 

'C. I. Bliss, “The calculation of the dosage mortality curve,” Annals of Applied Bi¬ 
ology, Vol. 22, pp. 134-167. 

* C. Spearman, “The method of ‘right and wrong cases’ (constant stimuli) without Gauss’ 
formulae,” British Jour, of Psych., Vol. 2, 1908, pp, 227-242. 
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2. The monwnts and tiieir errors. By definition, 

(1) -2? = 21 (Pi - Pi+i)(a‘< + Xi^i)/2 = 2 (Pi - Pi+i){*< + i). 

t -1 .. •-1 

If we let x\ represent the stimulus for which none of the samples can be affected, 
then 

(2) X = + .5 + ]C Vi > 

t-2 


as Spearman has shown (3). Since x\ is constant, and the Pi are all independent 
(non-correlated), it follows that (Ni being the number of objects in the ith 
sample) 


(3) 


2 _ 2 _ 2 
^Pl 


+ <Tpt + 


+ = 21 ' 


— V ?Ll' 

Ni 


(ance — 0). 

Again by definition, the gth moment about the origin is 


(4) 


M9 


n 



(pi - Pi+l)iXi + i)*. 


As before xi + .5 can be taken as the origin {xi + *5 = 0), in which case we have 

= (Pi - P2)-0* + (P 2 - Pa)-!* + (P3 - Pi)-2^ 

+ • • • + (Pn-i - P«)(n - 1)®. 

If we let h^,i represent the xth first difference of the consecutive gth powers of the 
positive integers (including 0), then 

n —1 

(6) Mo ^ ^0.* P» > 

by expansion of (5). Hereafter all - will be taken from i = 2 to i = n — 1. 
FAidently 



We are interested now in the standard error of the gth moment about the sample 
mean. To obtain this, compute first the correlation betw^een the gth and rth 
moments about the origin. 
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If Sug is taken to be the sample error in ixq due to deviations Spi from the true 
values, then we have 

n«-l 

n-1 

^Mr = Z) ^r.t* dpi . 

taa2 

Hence 

= L •(«?.)* + Z {b,.ibr,i + b,.ibr.i)SPiSpi . 

*f*J 

Summing for all samples: 

~ Z^ bg,ibr,iO‘p^ ~t" {bq,ibr,i "H bqjbr,i)(<J‘piffp^rp^pf) 

(9) 

^ Z) bq,ibr,i(rp^ . 

Since evidently Vp.p. vanishes for all i ^ j (the pi being completely independent 
in the statistical sense). 

In particular, when Hr = mi = we have 

(10) = 26i,i<rp^ . 

By definition, the ^h moment about the mean will be 

(11) M9 = 2(Pf - PHi){xi + i ^ xy ^ ^piixi + ^ - xy 
where Pi = pi - p.+i. 

For computational purposes, this may be written as 

(12) p, = + • • • + ,C.^V,-r+i + • • • + ^* 

where X = Ipi = mi , if 3:i + i is the origin. 

To obtain , where X is estimated from the sample, we may follow the usual 
procedures, arguing that 

(13) 6m, = 2{(Xi + mpi] - q^X^ixi + + T 

where T contains terms invohing X and higher powers of X. 

From (13) we obtain 

(14) = (tJ' + <1 tiqJxtTi — ZgMi-io'jo'p'rx^' + U 

where U involves X and higher powers. From (3), (8), (10) and (14) we have 

(15) ^qH-y^bq,i(rp^ + U 

= s(5,.i - gM^Vp, + u. 

We now shift the origin to X. All the terms in U vanish, mUi becomes m,-i , 
and the bq,i values go into &q,i , where 

^q.i « {i - xy - (i --1 -- xy. 
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m 

That is, (16) beoc«nes 

(16) < » S03,.< - . 

It is of interest to give an alternative proof of the relation (16) possessing the 
desirable property of being very short and simple and at the same time yielding 
an expression for the jSj.i in terms of 6r,<(l < r < g) and powers of J?. 

If xi + .5 is taken as the origin, then (11) may be written as (17) 

(17) M, - 2(* - Xypi. 

The application of the d-operation to both sides of (17) yields: 

(18) a/i, = i(i - SYdpi - qx(i - Jr)*-VJ«X 

(19) = 2 (d,,,- — qiif-i)6p {. 


Repetition of a previous argument gives the result: 

(20) <r*, = 2(/38.i — . 

In order to derive the relation connecting the with 6r.<(l < r < g) we 
expand S(t — ^)*5p< in equation (18). This expansion yields: 

Z(t - XYipi » 2(t» - ^CiY-^X + 

+ • • • + (-!)•-* + {-iyX*)8p'i 

= 23 (^«.< - tClbq—l,iX + qCibq~.t,iX* 

+ ... (-i)«->gX«-»6,,<)api 

i.e., 

= bq.i - qClbq-l.iX qC2bq^i.iX^ 

+ ... + (-l)«-‘g^«-‘6M. 


The relationship (16) combined with (22) enables one to compute the standard 
errors of a number of useful statistics. In particular in case g = 2 it follows that 

(23) • = <rU = 2(6^., - 2^)V„ . 


Combining (23) with the well-known result that 


(24) 

we see that 

(25) 


= <rM,/ 2 <r 

2v' 23 (2i - 3)p< - (22 p.)* 


Formula (25) is useful in significance tests involving the standard deviations of 
sensitivity data. 


3. Standard errors of the moments in standard units. We now turn our at¬ 
tention to the derivation of the standard error of the higher moments when 
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expressed in standard units. Before proceeding with the derivation it is con¬ 
venient to find the correlation l^tween the gth and rth moments about the mean. 
This result is an immediate consequence of (19), for since 

— qfig^i}dpi 

and 

(26) 8nr == S{i3r,* — 

it follows that 

(27) 8txg8iir = 2{/5,,t — — riiT-.i](8piY + Z 

where Z contains terms dpibpjii 9^ j). Hence, as before, 

(28) 1) {^r,t • 

I^et us now derive the standard errors of the moments in standard units, i.e., of 


(29) 

Now in general, 

* 

b 

11 

(30) 

(T^8y.q ~ qc^'^^iiqb(T _ (rbfXq — qtigba 
<T2ff (fff+l 

and since 


(31) 

5/X2 = 2<t 5(7 , or 5(7 == biJi 2 f 2 a- 

we have 


(32) 

2cr^bnq — qfig b^2 
■ '2cr»+= 

and hence 


(33) 

ft ^2 4<r*iSngf + ~ 4qa'iig5ngdni 

~ 4^S(«+2) 

(34) 

i 4#t5<r*, + 

4Mr 

In this case, it follows that 

(35) s 

4m2 Z) (j3«.< “ 




or 


4/ir 


( 36 ) 
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If the gth moment about the mean vanishes, then 
/,7x s _ 4 ms ]C “■ 


Ei sz ^ 


It is readily seen that the standard errors of the skewness and flatness are special 
cases of formula (36) when g = 3 and g = 4 respectively. 

4* Some minimization problems. In the analysis of sensitivity data it is most 
desirable to minimize or alt in order to increase the precision of significance 
tests involving X or a respectively. Therefore, it is of interest to solve the 
following problem: Suppose that we have a sample of size N which is to be sub¬ 
divided into n samples of size Ni to be tested at a number of fixed levels {x<}, 

n 

i = 1, 2 • • • n, A'', = iV. Tlien w'^hat choice of values {A^,} will minimize 

cTi = , where 23 = N? 

In order to solve this problem most quickly we use the method of Lagrange 
multipliers, i.e., we minimize the expression 

(38) Lm , X) = g + x(g Ni-N^. 

Taking the partial derivatives with respect to AT* we obtain the n equations 


(39) = = 

Summing over all values of i we obtain 

(40) 


t = 1, 2 


HVnqi 

t—1 


i.e., the best choice of values for jAT,} is given by 

(41). Ar,.£v&. 

2 V PiQi 

The value of (r| for this choice of the set {A",} is 




It is obvious that this is actually a minimum. In particular, it is less than the 
value of vj for iV< = iV, = N/n (the number of groups is n). This follows from 
the application of Schwartz’ inequality to (42), for 


( n _\ 2 n 

JlVpiqi) « Ep.?. 

4-1 _^ 


^ N 


(43) 
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which equals the value of for iVi » ATs ■= 
holds if and <Mily if pi == p* = • • • = p* . 
Suppose next that we wish to minimize 

(44) 

iml 

where 


• »= JV» = N/n. The equality 


XT' Pi 9i 

Imd tTW > 
f*i TV* 




We proceed as before to mii^imize the expression 

(45) - ± + x(g at: - at). 

Taking partial derivatives with respect to N't we obtain 


(46) 


fihpiqi 


X i.e. N'i 


I jOt,.-1 V PiQ i 

\U2 


1,2, ,n 


or summing over all values of i we obtain 


(47) = or f lAilVPiii. 

«i.l t-1 iV 

i.e., the best choice of values for {N'i] is given by 

(48) 


_ N I d2.< I VPiqi 
]C |ft.. |Vp. 8.- 


The minimum value of is given by 


(49) 


1 ft.t I V PiQi^ 


N 


In practice we desire a set {Ni] which will make aj and alt small simultane¬ 
ously. Unfortunately this is not in general possible. In fact, it may be asserted 
that the set [Ni] minimizing, <r| will yield a large value of <rji and similarly the 
set {Ni 1 minimizing alt will yield a large value of <r|. The reason for this curious 
behavior lies in the fact that the only difference between the set {iVt} and the 
set {Ni] is the set of numbers { | ft.t 1 } = I | (2t — 3) — 2X | }. These num¬ 
bers, however, change the character of the sets {Ni] and {Ni], In particular 
{Ni] takes on its largest values for both small and large values of i, whereas 
{Ni] takes on small values in these regions; {Ni] takes on small values for those 
values of i which are the integral values closest to .X -h 3/2, whereas {Ni] takes 
on large values for such values of t. It is this curious juxtaposition of {Ni] and 
{iVi} that renders it impossible to choose sets of numbers {Ni] minimizing cr| 
and all simultaneously. 
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This section is devoted to bri^ research and expository artides, notes on jnetkodolo^ 
and other short items. * t 


NOTE ON RUNS OF CONSECUTIVE ELEMENTS 




By J1 WoLFOwrrz 


Columbia University 


In my paper [1] I did not derive the asymptotic distribution of an omis* 

sion which I wish to correct in this note. 

Let the stochastic variable /? =* (o^i, • • • , Xn) be a permutation of the first n 


positive integers, where each permutation has the same probability —,. A sub¬ 
til 


sequence Xi^i , a:,+ 2 , • * • , Xi^i , is called a run of consecutive elements of length 
I if: 

a) when V is any integer such that !<?'<?, 


I Xi^v — ^t+r+11 = 1 

b) when t > 0, | x, — x,+i | > 1 

c) when i + I < n, \ Xi^i — x,+/+i | > 1. 

Let W{R) he the total number of runs in /?. Then n — W(R) is a stochastic 
variable which ^ it will be shown ^ has in the limit the Poisson distribution with mean 
value 2, More precisely, if p{w) is the probability that n — W{R) =» Wy then 


( 1 ) 


1 “ 


Proof: Define stochastic variables = 1, 2, • • • , n), as follows: yi = 1 if 
Xi is the first element of a run of length 2, y, = 0 otherwise. It is easy to see 
that the probability that xS == 1, 2, • , n) be the initial element of a run of 

length greater than two is 0 and hence that the probability of the occurrence 

of a run of length greater than two is 0 . Hence the limiting distribution of 

n W{R) is the same as that of 

n 

y == H Vi f 


provided either exists. 

The yi are dependent stochastic variables and almost all (i.e., all with the ex¬ 
ception of a fix^ number) have the same marginal distribution. We now wish 
to consider the expression 

••• y?/) 

(where the symbol E denotes the e;q)ectation) for any set of fixed positive in¬ 
tegers k, at, • • • , at, and for aU As-tuples it, it, • • • , it, with no two elements 

97 



ALBEBT H. BOWKEB 


equal. Now 

EivVvV * * • Vii) * E(yi,yi, • • • y.J 

is the probability that 2/»i = 1, i/<, = 1, • • • , == 1, simultaneously. This 

probability is either zero (for example, when 1 12 ~ 1 = 1, | — t* | =» 1, etc. 

or when ii = n, etc.) or + 0 • Moreover, the ratio of the number of 

A^tuples ii , i 2 , • • * , ik for which the probability is zero to the number of fc-tuples 

for which the probability is + 0 . Let Z*(t = 1, • * • , n) be 

independent stochastic variables each with the same distribution such that the 
probability that Z* = 1 is 2/n and the probability that Z,- — 0 is (n — 2)/n. 
It follows readily that the limit, as n oo, of the jth moment (j = 1, 2, • • • , 
ad inf.) of y about the origin, is the same as the limit of the same moment of Z, 
where 

z = i^Zi. 


Since the Zi are independently distributed, and since each can take pnly the 
values 0 and 1, the probability of the value 1 being 2/n, the jth moment of Z 
about the origin approaches, as n oo, 


« 'JO' 

-2 V ^ 

1 1 


which is the jth moment about the origin of the Poisson distribution with mean 
value 2. By the preceding paragraph, y.j is also the limit of the jth moment of 
y about the origin. Now von Mises [2] has proved that if the jth moment 
(j = 1, 2, • • • , ad inf.) of a chance variable Zn, (n = 1, 2, • • • , ad inf.), ap¬ 
proaches, as n —> 00 , the jth moment of a Poisson distribution, then the dis¬ 
tribution of Xn approaches the Poisson distribution with corresponding mean 
value. From this it follows that y has in the limit the distribution (1). We 
have already shown that y and n — IF(72) have the same limiting distribution, 
so that the required result follows. 
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NOTE ON CONSISTENCY OF A PROPOSED TEST FOR 
THE PROBLEM OF TWO SAMPLES 

By Albert H. Bowker 
Columbia University 

Certain tests for the hypothesis that two samples are from the same popula- 
tioh assume nothing about the distribution function except that it is continuous. 
Since the power functions of these tests have not been obtained, optimum 
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tests are not known. However, one desirable^ test property, that of 
ency,'' has been introduced by Wald and Wolfowita [1]. .A test is called con¬ 
sistent if the probability of rejecting the null hypothesis when it is false (the power 
of the test) approaches one as the sample number approaches infinity. Tli^ is a 
logical extension of the familiar idea of consistency introduced by Msher. It 
will be shown that a test recently proposed by Mathisen [2] is not consistent 
with respect to certain alternatives. 

The test proposed by Mathisen [2] may be described briefly as follows: Given 
two samples,' observe the number (m) of elements of the second sample whose 
values are less than the median of the first sample. The distribution of m is 
independent of the population distribution under the null hypothesis. Let 
P{m < a] denote the probability of the relation in braces under the null hy¬ 
pothesis. If m\ and Wo are significance points (mi > m^) such that 

P{m > mi} = 

(1) P{m < mo} = di 

/3i + /32 = iS < 1, 

the statistic m can be used to test the hypothesis at the significance level jS. 
This is called the case of two intervals. The method is extended by using the^ 
two quartiles and the median of the first sample to define four intervals into 
which the elements of the second sample may fall. If the second sample is of 
size 4n and the number which actually falls in each interval is rii, %, ns, and 
riA respectively, the distribution of 

(«■• - nf 


is also independent of the population distribution under the null hypothesis. 
Then if C* is a significance point, such that 

(3) P\C > C*} = < 1, 

C can be used as a test of the hypothesis at the level 0'. 

To show that Mathisen's test is not consistent, we shall consider fii'st the case 
of two intervals. Let A" and Y be two independent stochastic variables whose 
cumulative distribution functions F{x) and G{x) are continuous. Let 
< 0:2 * • < X 2 n+i and 2/1 < 2/2 ’ <y 2 n be sets of ordered independent observa¬ 

tions on X and F. Then m is such that 

2/m ^ ^ 2/»w+l • 

Let mi and m 2 be the significance points of the distribution of m, defined by 
(1). Clearly mi and m 2 depend on n. We shall prove that the sequence 

(4) .-1,2, 


^ For large samples. 
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.converges to Since (4) is bounded, it has at least one limit p<^t. Let h 
be such a limit point. If A < ^ and i — h SS, then there exists a monotonically 
increasing subsequence of the integers ni, n*, • • • and a number N such that 
for fii > N 


( 6 ) 


2ni 


- h 


< S. 


Clearly m/2n converges stochastically to §. Hence if 0 < « < 1 is any arbi¬ 
trarily small number, we can select n so large that the probability is at least 
1 — e that 


( 6 ) 


m 

2n 



S. 


Hence for n sufficiently large, P{m > mi} is at least 1 — a contradiction with 
(1). A similar contradiction appears if ft > Hence (4) has only one limit 
point, In the same way we can prove that the setjuence 


(7) 


maCn) 

2n 


n = 1, 2, 


also converges to 

Let 0 < d < I, Consider now two pairs of populations, A and B, described 
as follows: 


A) 

Fix) s Gix) = a: 

(0 < X < 1) 

B) 

Fix) s X 

(0 <x < 1) 


O 

HI 

1 

VI 

VI 

o 


Gix) = (x ~ J + 23)(J — B)/d 

a - 2S<x<j-S) 


Gix) s X 

ih — 


Gix) s ^ 4“ 5 

(§ + « < X < 1 - 5) 


Gix) s (J 4- 6) + (x — 1 + 5)(i — d)/d 

(1 - 5 < X < 1) 


For both A and B, F{x) s G(x) s 0 for x < 0 and F{x) ss G{x) s 1 for x > 1. 
For B, it will be shown that there exist values of n greater than any preassigned 
arbitrarily large number, such that the probability of rejecting the hypothesis 
when it is false is less than + ^2 + € where c is an arbitrarily small positive 
number. 

Let hi , h 2 , hz denote the number of observations on X which fall in the inter¬ 
vals 0<x<J — 5, J — 6<x<i + 5, i + 1 respectively for a fixed 

value of n. Let h [, Jh , be the corresponding numbers for F. For a fixed 
n, the probability of a set fei, , ft®, fej, , hz is the same whether the samples 

be drawn from A or B. From (4), (7), and the stochastic convergence of m/2n, 
it follows that we can find an N such that for all n > AT the probability is at 
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least 1 — c/2 of the occurrence of a set fej, ^ 2 , fes, fex, A*, for i 

Xn+i , Vmt will fall in the interval (i — 1, J + 5), Fuiihermore, for fixced fc*, 
ht the distribution within the interval is the same whether the sample came frtoi 
A or B. Hence, even when the sample is drawn from B, for n sufficiently large, 

P{m > mi] < ft + ^ 

P[m < mj} < ft + 

That is, for samples of sufficiently large size from B, the probability of rejecting 
the null hypothesis is at most iSi + ft + c. Since fii + 02 < I and c is arbitrarily 
smaU, the probability can be made less than one and the test is not consistent 
in the case of two intervals. 

In the case of four intervals, the proof is similar. In this case, we assume 
that the second sample has size 4n. Clearly, ni/4n, n2/4n, n8/4n, and ni/^n 
converge stochastically to J. If C* is the significance point defined by (3), the 
sequence 

C*(n) n = 1, 2, •. • 

converges to zero. Now consider two pairs, A and B, of populations. A is the 
same as before and B consists of one uniform distribution and one which is 
identical with the uniform distribution in small intervals containing a; — i, J, 
f, and 1, but is different everywhere else. As before, F(x) = G(x) s 0 for 
X < 0 and F(x) s G(x) = 1 for x > 1. Then for B, when n is large, the be¬ 
havior of C, except for a probability arbitrarily near zero, will depend only 
on the intervals of coincidence. Hence for B 

P{C > C*} < ft + € 

where e is any arbitrarily small positive quantity. 

Returning to the case of two intervals, if the samples are from different popula¬ 
tions and if their cumulative distribution functions are identical in the neighbor¬ 
hood of their medians, the test is not consistent. If such a possibility is excluded 
from the class of admissible alternatives, we may expect that the test will be 
consistent. For example, if the class of alternatives is limited to those where 
G(x) s F{x + c), c a constant, the test vill be consistent. A similar remark 
holds for the case of four intervals or for any fixed finite number of intervals. 
It appears, however, that if the number of intervals is a function of the sample 
size (say \/n) and becomes infinite with sample size, a test of this kind will be 
consistent with respect to .a general class of alternatives. 
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HENRY LEWIS RIETZ—IN MEMORIAM 

By a. R. Crathorne 
University of Illinois 

Forty odd years ago few if any American college catalogs mentioned the words 
‘‘mathematical statistics.” The word “actuary” often called for the use of a 
dictionary. Some courses in the theory of probability, theory of errors, or 
method of least squares touched on some phases of statistics but aside from this 
there was little interest in the subject. In England at this time Karl Pearson 
was well started in his work at University College but “Student” was an under¬ 
graduate student. In Germany, Lexis was finishing his somewhat unrecognized 
labors at Goettingen. In Denmark, Thiele, and in Norway, Charlier were lec¬ 
turing and writing on statistics from their own individual viewpoints. 

During the four decades which have passed, the interest in theoretical statistics 
in the United States has increased to the point where it has a well established 
journal of its own and few university mathematical departments fail to list 
statistical courses. In this growth no one has had more influence than the sub¬ 
ject of this memoir. His published papers, his personality, his students and his 
well directed energy have all been more than helpful in putting mathematical 
statistics where it is today. 

Henry Lewis Rietz, son of Jacob and Tabitha Jane Rietz, was bom August 24, 
1875 at Gilmore, Ohio. He attended the local schools and in 1895 entered Ohio 
State University receiving his B.S. degree in 1899. After graduation he w^ent 
to Cornell University as scholar, then fellow and assistant in mathematics. 
During his stay at Cornell he was closely associated with two other mathematical 
students, J. W. Young and H. W. Kuhn, later heads of the departments of mathe¬ 
matics at Dartmouth and Ohio State University respectively. In his last year 
Rietz was particularly interested in group theory and worked for his doctorate 
with Professor G. A. Miller who was then a member of Comeirs faculty. His 
dissertation was “On primitive groups of odd orders,” published later in the 
American Journal of Mathematics and referred to in the Encyclopedie des Sciences 
Mathematiques, After receiving the Ph.D. in 1902 he spent one year as professor 
of mathematics and astronomy at Butler College in Indianapolis. 

In 1903, Rietz accepted an instructorship at the University of Illinois where he 
stayed until 1918 becoming full professor in the meantime. In 1918 he was 
called to the University of Iowa as head of the department of mathematics, a 
position he held until his retirement in 1942. 

During his first year at Illinois his interests were mainly in pure mathematics. 
His advanced courses were “Theory of Invariants” during the first semester and 
“Higher Plane Curves” during the second. During the next year a demand 
arose for some course in statistics. None of the members of the mathematics 
department were particularly prepared to give such a course but Rietz was 
induced to try it. The result was that he offered a course “Averages and 
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Mathematics of Investment.’^ This curious title was ervidence of the fact that 
actuarial science had not reached the independent position that it has at the 
present time. In the following year he was appointed to the position of statisti¬ 
cian of the College of Agriculture and from that time on during his stay at 
Illinois he divided his time equally between the department of mathematics and 
that college. His work as statistician was mainly supervision of the statistical 
work in the published bulletins. The first publication in the statistical field 
under his name was the 32-page appendix to Dean Davenport’s treatise on 
breeding. 

The first published statistical study was in 1908, a master’s thesis for Miss 
Shade on ^‘Correlation of efficiency in mathematics and in other subjects,” 
printed as one of a series of University Studies. It is interesting to recall the 
attention which this paper received, especially from educational circles. It 
seemed to fix the method and form of calculation of correlation coefficients which 
occupied the time of many people during the following years. In these early 
years it was rather difficult to find a place of publication for a mathematical 
paper on statistics. Mathematical journals were somewhat reluctant in accept¬ 
ing articles. I remember one occasion when he jokingly complained of the 
correspondence necessary to explain to an editor the word “correlation” vfj^d 
in a paper. 

From 1908 on, Rietz published a long list of papers on statistical topics, some 
purely theoretical, some expositional, some arising out of his connection with the 
college of Agriculture. Together with his later actuarial studies the list totals 
150 titles, the more important of which are included in this article. His much 
quoted paper of 1920 on “Um Schemata” pleased him more than any other paper. 
His little book Mathematical Statistics, one of the C’^arus mathematical mono¬ 
graphs, written in 1920 was the basis for many university courses for years 
afterward. 

In 1909 the American Institute of Actuaries was organized in Chicago, and 
Rietz was a charter member. He took particular delight in this organization 
and was rarely absent from the meetings. He was elected vice-president in 
1919. He liked meeting practical actuaries and had a wide acquaintance among 
them. In 1916 he was appointed a member of the Illinois Pension Laws Com¬ 
mission and became its actuary. From that time on his interests were pretty 
evenly divided between mathematical statistics and actuarial problems connected 
^^^th pensions. He was appointed actuary of the Chicago Pension Commission 
in 1926, was consulting actuary for the Presidents National Committee on 
Economic Security 1934, and was a member of the board of trustees of the 
Teacher’s Insurance and Annuity Association 1934-38. His services as a con¬ 
sulting actuary were sought by a great many pension projects both in educational 
and in business circles. When he Avent to the University of Iowa in 1918, he 
accented actuarial theory in his teaching. Under his leadership the department 
became an outstanding school in this field. Many of his students hold prom¬ 
inent positions in the actuarial Avorld. 
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In 1923, Rietz with eight others was appointed a member of the Committee 
on the Mathematical Analysis of Statistics of the division of Physical Sciences 
of the National Research Council. The work of this committee developed into 
the preparation of the “Handbook of Mathematical Statistics’’ with Rietz as 
Editor-in-Chief. This work had considerable use for a number of years after its 
publication and is an important part of the history of mathematical statistics 
in this country. A Russian edition of this book appeared in 1927 with a very 
long preface as a sort of apology for the translation. A few excerpts from this 
preface are—“Mathematical statistics is a purely technical weapon, politically 
unbiased, can serve with equal facility either to thwart or to expedite the move¬ 
ment for the emancipation of the proletariat depending in whose hands it happens 
to be;” “Mathematical statistics has notliing to do with philosophical enlighten¬ 
ment;” “Hence this book harbors no dangers for a soviet reader;” “The fact that 
the western authors work in a bourgeois society has no bearing on their methods.” 

The Institute of Mathematical Statistics was organized in 1935 with Rietz as 
the informal chairman of a steering committee during the months of discussion 
preceding the organization. He thus became the logical first president. He 
has taken a more than active interest in the Institute,—^as a contributor to 
the Annals, as one of its editors, as general counselor, as a good friend. In 
appreciation of this and in recognition of his contributions to the initiation and 
development of mathematical statistics in America, the 1943 volume of the 
Annals of Mathematical Statistics was dedicated to him, on the occasion of his 
retirement after twenty five years of service as head of the department of mathe¬ 
matics at the University of Iowa. 

Professor Rietz received many honors in other fields. He was President of 
the Mathematical Association of America in 1924, vice-president of the American 
Statistical Association in 1925, vice-president of the American Mathematical 
Society 1928-9, and a member of the editorial staffs of the Bulletin and the 
Transactions of that society for many years, president of the Iowa Academy of 
Science, 1931. He was starred in American Men of Science, a fellow of the 
Royal Statistical Society of London, and of the American Association for the 
Advancement of Science. He took great interest in local affairs and held many 
offices in church, social and business organizations in Iowa City. His mind was 
not altogether centered upon research and the development of mathematics and 
statistics. He took great pride in his teaching. He was the principal author 
of a number of college texts in mathematics that had wide use, and was on many 
committees concerned uith the problem of teaching mathematics to under¬ 
graduates. 

At the time of his retirement. Professor Rietz was in failing health and was 
practically an invalid until the time of his death at the University Hospital at 
Iowa City on December 7, 1943. He leaves a brother. Professor John Rietz, 
Morgantown, West Virginia, and a sister, Mrs. T. S. Taylor, Caldwell, New 
Jersey. 
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SBLBtiTED Titles of Pubucations by H. L, Eibtz 

1. primitive groups of odd order,** Amer. Jour, of Math., Vol. 26 (1904), pp. 1-30, 

2. “On groups in which certain commutative operations are conjugate,** Tram. Amer, 

Math. Soc., Vol. 5(1904), pp. 600-508. 

3. “Simply transitive groups which are simple groups,** Bulletin Amer. Math. Soc,, Vol. 

11(1905), pp. 545-46. 

4. “Statistical Methods. Appendix to Principles of Breeding.** A Treatiee on Threm* 

matology by E. Davenport, Ginn and Co., Boston, 1907, pp. 681-713. 

5. “Correlation of efficiency in mathematics and efficiency in other subjects—A Statistical 

Study,** Rietz and Shade, The University Studies, Urbana, Illinois, November 
1908, 20 pp. 

6. Statistical Methods Applied to the Study of Type and Variability in Corn, Eugene Daven¬ 

port and Henry L. Rietz, Bulletin No. 119, Illinois Agricultural Experiment 
Station, 1907. 

7. “On inheritance in the production of butter fat,** Biometrika, London, Vol. 7(1909), 

pp. 106-126. 

8. “On a mean difference problem that occurs in statistics,** Amer. Math. Month., Vol. 

17(1910), pp. 235-40. 

9. “On the metabolism experiment as a statistical problem,** Rietz and Mitchell. Jour. 

of Biol. Chem., 1910. ‘ 

]0. On the Measurement of Correlation with Special Reference to Some Characters of Indian 
Corn, Henry L. Rietz and Louis H. Smith, Bulletin No. 148, University of Illinois 
Agricultural Experiment Station, November 1910. 

11. “On the construction and graduation of a rural life table,** Rietz, H. L. and Forsyth, 

C. H., Record Amer. Inst, of Actuar., Vol. 1(1911), pp. 9-19. 

12. “On the theory of correlation with special reference to certain significant loci on the 

plane of distribution in the case of normal correlation,** Annals of Math,, Vol. 
13(1911?), pp. 187-199. 

13. “Note on the definition of an as 3 'mptote,** Amer. Math. Month., Vol. 19 (1912), pp. 89-90. 

14. “The determination of the relative volumes of the components of rocks by mensuration 

methods,** Lincoln and Rietz. Economic Geology, Vol. VIII, No. 2, March, 1913. 

15. “On the mathematical theory of risk and Landre*s theory of the maximum,** Record 

Amer. Inst. Actuar., Vol. 11(1913), pp. 1-14. 

16. “On the status of certain current pension funds,** Record Amer. Inst. Actuar., Vol. 

3(1914), pp. 33-53. 

17. “Group Insurance,** Record Amer. Inst. Actuar., Vol. 3(1914), pp. 277-79. 

18. “Degrees of resemblance of parents and offspring with respect to birth as twins for 

registered Shropshire sheep.** (Rietz, H. L. and Roberts, Elmer.) Jour.Agric. 
Res., Vol. 4, No. 6, pp. 479-510. 

19. “Note on double interpolation by finite differences,** Record Amer. Inst, Actuar., Vol, 

4(1915), pp. 15-22. 

20. “On the correlation of marks in mathematics and law,** Jour. Educ. Psych., Vol. 7, No. 

2, pp. 87-92. 

21. “The operation of pension laws in foreign countries,** Report of Illinois Pension Laws 

Commission, 1916, pp. 19-35. 

22. “Actuarial Report on Pension Funds for Public Employees of Illinois,** H. L. Rietz, 

and D. F. Campbell, Report of Illinois Pension Laws Commission, 1916, pp. 
72-199. 

23. “The underlying principles of a pension plan,** H. L. Rietz, G. E. Hooker, and D. F. 

Campbell, Report of Illinois Pension Laws Commission, 1916, pp. 272-284. 

24. “Report of Illinois Pension Laws Commission,** Rietz and others, 1916, pp. 310, 
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25. the value of certain proposed refunds payable at the death of an annuitant under a 

pension system,** Record Amer, Imt, Actuar., Vol. 6(1917), pp. 62-75. 

26. ‘‘A statistical study of some indirect effects of certain selections in the breeding of 

Indian corn,** Rietz, H. L. and Smith, L. H., Jour, of Agric. Res., Vol. 11(1917), 
pp. 105-46. 

27. “Methods of providing for expenses of new business by life insurance companies,** 

Amer, Econ, Rev.y Vol. 7(1917), pp. 832-38. 

28. “A Report to the Trustees of the Carnegie Foundation by the Commission Chosen to 

study and report upon the Proposed Plan of Insurance and Annuities,** Rietz 
and others, 19 pp. 

29. “Pensions for public employees,** Amcr. Polit. Sci. Review, Vol. 12(1918), pp. 265-68. 

30. “Statistical methods for preparation for war department service,** Amer. Math. Month., 

Vol. 26(1919), pp. 99-100. 

31. “Recent developments in pension plans for public employees,** Record Amer. Inst. 

Actuar., Vol. 8(1919), pp. 1-12. 

32. “Scope and advantages of courses of instruction on life insurance in American colleges 

and universities,** Record Amer. Inst. Aciuar., Vol. 8(1919), pp. 202-06. 

33. “The effect of present inflated prices on the future interest rate,** Record Amer. Inst. 

Actuar., Vol. 8(1919), pp. 308-14. 

34. “On functional relations for which the coefficient of correlation is zero,.** Quart. Pub. 

Amer. Siat. Assn. September 1919, pp. 472-76. 

35. “Exposition of the main provisions of the standard plan for a combined comprehensive 

annuity and insurance system for public employees,** Illinois Pension Laws Com¬ 
mission Report, 1919, pp. 25-48. 

36. “Recent developments in pension legislation in other states of the United States,** 

Illinois Pension Laws Commission Report, 1919, pp. 201-06. 

37. “The world’s experience in the operation of public service pension systems,’* Illinois 

Pension Commission Report, 1919, p. 207-14. 

38. “Summary of the report of the Illinois Pension Laws Commission of 1916,*’ Illinois 

Pension Commission Report, 1919, pp. 215-22. 

39. “Effects of the pension legislation by the fiftieth general assembly of Illinois,” Illinois 

Pension Commission Report, 1919, pp. 223-34. 

40. “Illinois state teachers* pension and retirement system. Illinois Pension Commission 

Report,** 1919, pp. 235-38. 

41. “Industrial and institutional pension systems,” Illinois Pension Commission Report, 

1919, pp. 239-50. 

42. “Urn schemata as a basis for the development of correlation theory,” Annals of Math., 

Vol. 21(1920), pp. 306-22. 

43. “On certain properties of Makeham’s laws of mortality,” Amer. Math. Month., Vol. 

27(1921), pp. 152-65. 

44. “Pension systems for insurance, company employees,” Record Amer. Inst. Acinar., 

Vol. 10(1921), pp. 1-14. 

45. “An elementary exposition of the theorem of Bernoulli with applications to statistics,” 

Math. Teacher, Vol. 14(1921), pp. 427-34. 

46. “Frequency distributions obtained by certain transformations of normally distributed 

variates,” Annals Math., Vol. 22(1922), pp. 292-300. 

47. ‘“Statistics* in a Mathematical Encyclopedic Dictionar}",” Amer. Math. Month., Vol. 

29(1922), pp. 333-337. 

48. “On the subject matter of a course in mathematical statistics,” Presented before the 

Mathematical Association of America at the Symposium held in Cambridge, Dec. 
29, 1922, Amer. Math. Month., Vol. 30(1923), pp. 156-166. 

49. “On certain topics in the mathematical theory of statistics, ” symposium Lectures before 

American Mathematical Society, Bull. Amer. Math. Soc., Vol. 30(1924), pp. 
417-453. 
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60. ^'On anauity rates/' Record Amer. Inet. ActUar., Vol. 13(1924), pp. 120-122. 

61. certain applications of mathematical statistics to actuarial data," Record 4«^* 

InBt. Aciuar., Vol. 13(1924), pp. 214-260. 

62. "On a certain law of probability of Laplace," Proc, Internal, Math. Cang.f Toronto, 

Vol. 2(1924), pp. 795-799. 

63. "Note on average numbers of brothers and of sisters of the boys in families of N chil¬ 

dren," Science, Vol. 60(1924), pp. 46-47. 

54. "On the representation of a certain fundamental law of probability," Trane. Avner. 
Math. Soc., Vol. 27(1926), pp. 197-212. 

66. "On applications of statistical methods in actuarial science," Record Amer. Inst. 
Actuar., Vol. 14(1925), pp. 102-04. 

56. "Mathematical background for the study of statistics," H. L. Rietz and A. R. Cra- 

thorne. Jour. Amer. Slat. Assn., Vol. 21(1926), pp. 435-440. 

57. "On certain applications of the differential and integral calculus in actuarial science," 

Amer. Math. Month., Vol. 33(1926), pp. 9-23. 

58. "Note on the most probable number of deaths," Record Amer. Inst. Actuar., Vol. 

16(1927), pp, 26-29. 

59. "On certain properties of frequency distributions of the powers and roots of the variates 

of a given distribution," Proc. Nat. Acad. Sci., Vol. 13(1927), pp. 817-20. 

60. Discussion of "Interpolation with modified coefficients," Record Amer. Inst. Actuar., 

Vol. 16(1927), pp. 232-33. 

61. "On the risk problem from a mathematical point of view," Neuvienne Congres Inter-- 

national D^Actuaires, Rapports, Tome II, (1930), pp. 294-306. 

62. "Pensions for superannuated employees," Retiring vice-presidential address before 

Section K. AAAS, Sci. Month., March 1930, pp. 224-30. 

63. "On certain properties of frequency distributions obtained by a linear fractional trans¬ 

formation of the variates of a given distribution," Annals of Math. Stat., Vol. 
2(1931), pp. 38-47. 

64. "Note on the distribution of the standard deviation of sets of thfee variates drawn at 

random from a rectangular distribution," Biomeirika, Vol. 23(1931), pp. 424-26. 

65. "Some remarks on mathematical statistics," Retiring president’s address before the 

Iowa Academy of Science, Science, Vol. 74(1931), pp. 1-4. 

66. "Comments on applications of recently developed theory of small samples," «/our. Amer. 

Stat. Assn., Vol, 26(1931), pp. 37-44. 

67. "A simple non-normal correlation surface," Biometrika, Vol. 24(1932), pp. 288-90. 

68. "On the Lexis theory and the analysis of variance," Bull. Amer. Math. Soc.,\o\. 

27(1932), pp. 731-35. 

69. "Unemployment and social insurance," Record Arner. Inst. Actuar., Vol. 23(1934), pp. 

147-52. 

70. "On the frequency distribution of certain ratios," Annals of Math. Stat., Vol. 7(1936), 

pp. 145-53. 

71. "Some topics in sampling theory,” Bull. Amer. Math. Soc., Vol. 43(1937), pp. 209-30. 

72. "Collective insurance," Bull. Amer. Assn. Univ. Prof., Vol. 23(1937), pp. 278-81. 

73. "On the distribution of the ‘Student’ ratio for small samples from certain non-normal 

distributions," Afinals Math. Stat., Vol. 10(1939), pp. 265-74. 

74. "On a recent advance in statistical inference," Amer. Math. Month., Vol. 45(1938), 

pp. 149-58. 

Doctorate Dissertations Written Under the Supervision of Professor Rietz 

Reilly, John Franklin, 1921. On certain generalizations of osculatory interpolation. 

Weida, Frank M., 1923. The valuation of life annuities with refund of an arbitrarily as¬ 
signed part of the purchase price. 

Smith, Clarence De Witt, 1928. On generalized Tchebycheff inequalities in mathematical 
statistics. 
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Meyery Herbert A., 1929. On certain inequalities with applications in actuarial science, 
Craigy Allen Thornton^ 1931. On the distribution of certain statistics derived from Bmkll 
random samples. 

WilkSy Samuel Stanleyy 1931. On the distribution of statistics in samples from a normal 
population of two variables with matched sampling of one variable. 

Fischery Carl 1932. On correlation surfaces of sums with a certain number of random 
elements in common. 

Harper^ Floyd S.y 1935. An actuarial study of infant mortality. 

OlliveTy Arthur^ 1935. On certain mathematical developments underlying an analysis of 
general death rates. 

Knowlery Lloyd A.y 1937. Actuarial aspects of recent old age security legislation. 

Olsheny Abraham C., 1937. Transformations of the Pearson type III distribution. 

Bergy William D., 1941. Theorems on certain type A difference equation graduations. 
SatterthwaitCy Frankliny 1941. Developments on the theory of Chi-square. 

Garfiny Louisy 1942. A comparative study of the underlying principles of certain pension 
schemes for a staff of employees with special reference to teachers and public 
employees. 

The last two dissertations were under the joint supervision of Professor Rietz and Pro¬ 
fessor A. T. Craig. 

Books 

1. College AlgehrOy H. L. Rietz and A. R. Crathoftie, First edition, 1909, Fourth edition, 

1939. Henry Holt and Company, New York. 

2. School AlgebrOy two volumes. H. L. Rietz, A. R. Crathorne, E. H. Taylor, 1915. Henry 

Holt and Company. 

3. Mathematics of FinancCy H. L. Rietz, A. R. Crathorne, J. C. Rietz. First edition 1921, 

second edition 1929. Henry Holt and Company. 

4. Introductory College Algebra. H. L. Rietz and A. R. Crathorne. First edition 1923, 

second edition 1933. Henry Holt and Company. 

5. Handbook of Mathematical Statisticsy H. L. Rietz, Editor-in-chief, 1924, Houghton 

Mifflin Company. 

6. Mathematical Statistics. Third Cams Mathematical Monograph, published for the 

Mathematical Association of America by the Open Court Publishing Co. 1927. 

7. Review of Pre-college MathematieSy C. J. Lapp, F. B. Knight, H. L. Rietz, 1934. Scott, 

Foresman and Company. 

8. Plane Trigonometryy H. L. Rietz, J. F. Reilly, Roscoe Woods, 1935. The Macmillan 

Company. 

9. Plane and Spherical Trigonometry. H. L. Rietz, J. F. Reilly, Roscoe Woods, 1936. 

The Macmillan Company. 

10. Intermediate Algebra, H. L. Rietz, A. R. Crathorne, L. J. Adams, 1942. Henry Holt 

and Company. 

11. Review of Mathematics for College Students. C. J. Lapp, F. B. Knight, H. L. Rietz, 

1942. Foresman and Company. 



HEWS AND NOTICES 

Readers are invited to submit to the Secretary 'of the Institute 
news items of general interest 

Personal Items 

Lt. Col. Joseph Berkson is now stationed at Headquarters, Anny Air Fwjes, 
Air Surgeon’s Office, Washington 25, D. C. 

Dr. Ernest E. Blanche is now Statistical Director, Office of the Director of 
Engineering, with the Curtiss-Wright Corporation at Buffalo. 

Dr. Alva E. Brandt is overseas serving as Operation^; Analyst for the Army 
Air Forces. 

Professor W. G. C'ochran and Dr. A. M. Mood are serving as Research Mathe¬ 
maticians on a war research project at Princeton University. Professor Cochran 
is on leave of absence from Iowa State College; Dr. Mood from the University 
of Texas. 

Mr. Rol>ert Dorfman is overseas serving as Operations ^Vnalyst with the 13th 
U. S. Air Force. 

Mr. R. M. Foster of Bell Telephone Laboratories has been appointed professor 
and head of the department of mathematics of the Polytechnic Institute of 
Brooklyn. 

Dr. Andrew I. Peterson is now Director of Manufacturing Research at the 
Victor Division of the Radio Corporation of America, Camden, N. J. 

Dr. Edward Helly, visiting lecturer at the Illinois Institute of Technology, died 
on November 28, 1943. 

* Professor Henry L. Rietz died on December 7, 1943 after a long period of 
illness. An account of Professor Rietz’ scientific life and achievements by 
Professor A. R. Crathome appears on pp. 102-108 of the present issue of the 
Annals, 


Hew Members 

The following persons have been elected to membership in the Institute: 

Allen, Roy George Douglas. D.Sc. (London). Reader in Economic Statistics, University 
of London. Apt. 219, 2745 29tli St. NW, Washington 8, D. C. 

Burk, Mrs. Marjorie F. A.B. (Hunter) Assoc. Statistician, Meteorology, HQ, AAF. 
1912 Third St. NE, Washington 2, D. C. 

Churchman, C. West. Ph.D (Pennsylvania) Research Statistician, Frankford Arsenal 
and Lecturer in Philosophy, Univ. of Pennsylvania. Frankford Arsenal, Philadelphia, 
Pa. 

Crump, S. Lee. B.S. (Cornell) Research Associate, Iowa State College. Statistical Lab , 
Iowa State College, Ames, Iowa. 

Divatia, M. V. M.A. (Columbia) Statistical Officer, Dept, of Industries and Civil Sup¬ 
plies, New Delhi, India. 

Freund, John E. . B.A. (U.C.L.A.) Box 4221 Westwood Village Station, Los Angeles 24, 
Calif. 
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OUl, John P, Statistician, Dopt. of Research and Statistics, Federal Reserve Bank, 
Dallas, Texas. 

Lindsey, Fred D. M.A. (George Washington Univ.)- 628 West 114 St., New York, N. Y. 

Maloney, Clifford J. M.A. (Minnesota) 2d Lt., Sig. C. 1535 18th St. N., Arlington, Va. 

Mandel, John. Licence en Sciences (Univ. of Brussels) 45 Kew Gardens Rd., Kew 
Gardens, N. Y. 

Mathieus, George John. B.L. (Univ. of Dayton) Shop followup—Assembly Planning, 
Douglas, Long Beach. 1853 Poppy St., N. Long Beach, Calif. 

Mclnt3rre, Francis E. Ph.D. (Chicago) Program Officer, Foreign Economic Administra¬ 
tion. Rm. 2446 Temporary U Bldg., Washington 25, D. C. 

Raybouid, Ethel H. M.A. (Queensland) Lecturer in Mathematics. The University 
of Queensland, Brisbane, Australia. 

Rosenblatt, Alfred. Ph.D. (Cracow) Catedratico de la Universidad de San Marcos, 
Lima. Calle Atahuaipa 192, Miraflores, Peru. 

Saunders, Robert J. B.8. (Mass. Inst, of Tech.) Captain, Ordnance Dept., Inspection 
Sec., Amm^n Branch, Office Chief of Ordnance. Office Field Director Ammunition 
Plants, 3637 Lindell Blvd., St. Louis 8, Missouri. 

Schwartz, David H. B.S. (C.C.N.Y.) Assoc. Statistician, Office of the Quartermaster 
General. 338 N. Geo. Mason Dr., Arlington, Va. 

Solomons, Leonard M. B.A. (Columbia) Time Study Man. 150-11 88 Ave., Jamaica 2, 
N. Y. 

Steinberg, Joseph. B.S. (C.C.N.Y.) Associate Statistician, Bureau of Research and 
Statistics, Social Security Board. 5041 North Capitol Street, Washington 11, D. C. 

Tomlinson, Malcolm C. W. 3820 Southern Ave., S.E., Washington, D. C. 

Wilson, Edward F. Asst, Engineer, Special Projects. Bldg. 650, Research Center, Aber¬ 
deen Proving Ground, Md. 


Announcements 

Washington Meeting of the Institute 

There will be a joint sectional meeting* of the Institute of Mathematical 
Statistics and the American Statistical Association at the Hotel Statler and 
George Washington University in Washington, D. C., on Saturday and Sunday, 
May 6-7, 1944. 

On Saturday afternoon there will be a session on the Theory of Statistical Infer¬ 
ence with Professor A. Wald and Lt. J. H. Curtiss as the speakers. On Sunday 
morning there mil be a session on contributed papers and on Sunday after¬ 
noon a session will l)e devoted to Time Series mth Profe.ssor J. L. Doob and Dr. 
T. Koopmans as the speakers. 

Summer Meeting of the Institute 

The summer meeting of the Institute will be hold in conjunction with the 
summer meetings of the American Mathematical Society and the Mathematical 
iVssociation of America, at Wellesley Cyollege, Wellesley, Massachusetts, on 
August 12-14, 1944. Abstracts of contributed papers for this meeting should 
be sent (in duplicate) to the Secretary of the Institute before July 1, 1944. 

* The Washington meeting of the Institute had been announced in the December, 1943 
issue of the Annals for April 27-28,1944, but the date has been revised to May 6-7, in order 
that the meeting may be held jointly with the Association. 
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ANWtAL REPORT OF THE PRESm^T OF THE DfSTmiTE 

During 1943, the second war year in which a regular annual meeting <rf tJie 
Institute was not held, the bu^ess of the Institute was necessarily largely con¬ 
ducted by mail. As reported in the March issue of the Annate, the Secretary, 
the Editor of the Annals, and I met in Pittsburgh in January for discussion of the 
Institute’s affairs. Since three members of the Board of Directors do not consti¬ 
tute a quorum, certain proposals arising from this informal meeting were sub¬ 
mitted by mail to the whole Board for action. At the fall meeting of the Institute 
held m New Brunswick a quorum of the Board was present, Hotelling, Wald, 
Wilks, Craig, and action was taken on certain matters. The chief subject for 
consideration was future meetings of the Institute. It was agreed that again in 
accordance \vith the request of the O.D.T. no annual meeting should be planned 
for 1943. Because of the success of local meetings held in New York City in 
May and in Washington in June, it was voted to repeat these in 1944. The New 
Brunswick meeting was also very successful, and it is hoped that we may again 
hold our fall meeting in conjunction with those of the American Mathematical 
Society and the Mathematical Association of America in 1944. The desirability 
of other local meetings in addition to those in New York and Washington was 
recognized but, so far, I know of no plans for any in 1944. 

During the year two local chapters of the Institute were organized in Pittsburgh 
and Washington in accordance with regulations adopted by the Board of Direc¬ 
tors, and these were recognized by the Board. The sponsor for the Pittsburgh 
group is E. G. Olds, and W. G. Madow has the same responsibility for the 
Washington chapter. 

The Membership Committee for 1943 consisted of Vice-President Deming, 
(Chairman, W. G. (vochran, P. S. Dwyer, and A. J. Lotka. As the result of their 
recommendation the following eighteen new Fellows of the Institute were elected 
by the Board of Directors: A. H. (Xipeland, J, H. Curtiss, J. F. Daly, C. E. Dieulc- 
fait, H. F. Dodge, Churchill Eisenhart, Will Feller, Milton Friedman, M. A. 
Girschick, M. H. Hansen, P. G. Hoel, Tjalling Koopmans, A. M. Mood, L. J. 
Reed, L. E. Simon, F. F. Stephan, W. R. Thompson, and Jacob Wolfowitz. 

W. D. Baten, L. A. Aroian, I. W. Burr, and 11. F. Dodge, at the lequest of the 
Board, continued to serve as a committee for securing additional library subscrip¬ 
tions to the Annals. Programs for the New York and New Brunswick meetings 
were in charge of Vice-President Wald, and that for the Washington meeting was 
arranged by W. G. Madow. 

A proposal originating with Harold Hotelling that the Institute consider peti¬ 
tioning the Federal Government that the W. P. A. Computing Project be made a 
permanent computing group for the service of scientific research in the constme- 
tion of important numerical tables, was referred to a committee consisting of 
A. R. Cmthorne, C^hairman, P. S. Dwyer, and Will Feller. As a result of their 
report, the Board appointed the following permanent committee on tabular 
computation, P. S. Dwyer, C'hairman, Churchill Eisenhart, and Will Feller. It 



112 


BEPOBT OF PRESIDENT 


is expected that this committee will cooperate with representatives of other 
scientific organizations interested in tabular computation. 

The Nominating Committee for the recently held election of the Institute was 
A. T. Craig, Chairman, B. H. Camp, and J. H. Curtiss. G. W. Snedecor served 
the Institute as its representative on the Council of the American Association for 
the Advancement of Science. 

To all of those mentioned the Institute is indebted, and for the Board of Direc¬ 
tors I wish to thank them for their special contributions to the Institute. 

I have reserved for particular mention the services of Vice-President Deming 
who was appointed by the Board the official representative ot the Institute to deal 
with officials of the Selective Service Board in Washington relative to the defer¬ 
ment from military service of competent and experienced statisticians engaged 
in work important for the prosecution of the war. He gave much time and effort 
to this on his o^vn initiative, and the Institute and all statisticians owe him much 
in that he has been very successful in convincing important Washington officials 
of the value of the services being rendered by good statisticians. 

In December the Board with deep regret accepted the resignation of E. G. Olds 
who was completing his third year as Secretary of the Institute. No member of 
the Institute will need to l)e reminded of the devotion and efficiency with which 
he filled his office. A mere comparative inspection of his annual reports would 
reveal to anyone otherwise iminformed how much the Institute owes to him. 
During his term of office the membership of the Institute has approximately 
doubled and the financial position of the Institute has been markedly improved 
in spite of war conditions. For both of these favorable circumstances he de¬ 
serves the greater part of the credit. In his present position as Chief Statistical 
Consultant with the Office of Production Research and Development of the 
War Production Board in association with Holbrook Working, he is still serv'- 
ing the cause of statistics as well as the war effort. The Institute will greatly 
benefit by the intensive work these men are doing in educating industry in the 
uses of statistical methods in the control of quality in production. 

The annual election of the Institute just concluded by mail resulted in the elec¬ 
tion of the following officers for 1944: W. A. Shewhart, President; W. G. Cochran 
and Will Feller, Vice-Presidents; and P. S. Dwyer, Secretary-Treasurer. Pro¬ 
fessor Dwyer had been appointed Acting Secretary-Treasurer upon the resigna¬ 
tion of Professor Olds. I need not remind the members of the Institute that the 
year 1944 remains critical for the Institute and that these new officers will need 
the fullest support. 

C. C. Craig, 

PresidefU^ 1943. 


February 16, 1944. 
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ANimAL REPORT OF THE SSCRETARY-TREASORER OP TBE 

OrSnTDTE 

During 1943 three meetings of the Institute were held. On May 29, the Insti¬ 
tute met jointly with the American Society of Mechtmical E^ipheers at the 
Engineering Societies Building in New York City, the program having bera 
arranged by Abraham Wald and A. I. Peterson. A meeting consisting of three 
evening sessions was held on June 17-19, at George Washington Univeraty, 
Washington, D. C. William G. Madow made the necessary preparations for this 
meeting. On September 12-13, the Institute held its sixth summer meeting at 
New Jersey College for Women, Rutgers Univeraty, New Brunswick, New 
Jersey. This meeting was held in conjunction with the summer meetings of the 
American Mathematical Society and the Mathematical Association of America, 
and the program was arranged by Abraham Wald. All three of these meetings 
were well attended. 

Early in the year a petition was granted for the establishment of a Pittsburgh 
Chapter of the Institute. This organization, formerly known as the Society 
of Quality Control Statisticians, held its first meeting under the auspices of the 
Institute 6n June 19, and a second meeting was held on October 9. 

As recorded in an addendum to last year’s report, Vice-President E. L. Dodd 
died on January 9, 1943, and Dr. W. E. Doming was appointed to fill out the 
unexpired term. 

Professor H. L. Rietz died on December 7,1943. A statement of appreciation 
for his work in mathematical statistics and in connection with the Institute has 
been prepared by Professor A. R. Crathome and appears in the present issue 
of the Annals. 

The following financial statement covers a period from December 10, 1942, to 
December 21, 1943 (the books and records of the Treasurer have been audited 
by Paul S. Dwyer and found to be in agreement with the statement as sub¬ 
mitted) : 


FINANCIAL STATEMENT 
December 10, 1942, to December 21, 1943 
Receifts 


Balance on Hand, December 10, 1942. $2,136.13 

Dues. 2,640.62 

SUBSCWPTIONB . 1,631.67 

Sales OF Back N umbebs . 901.47 

Miscellaneous . 6.07 


Total Receipts 


17,334.96 
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Expenditures 

Annajls Office. 

Waverly Press 

Printing and Mailing Annals —4 issues. 

Back Numbers Office 

Purchase of back numbers from H. C. Carver. $171.66 

Reprinting 200 copies of Vol. VII, No. 1. 108.08 

279.64 

. 36.11 

$136.00 
2.26 
136.77 
171.20 


444.22 

Programs for Meetings . 44.22 

Board, of Directors . 41.74 

Miscellaneous . 5.45 

Total Expenditures. $3,619.91 

Balance on Hand, December 21, 1943 . 3,715.05 


$7,334.96 

In comparison with the financial condition of the Institute at the end of 1942, 
the receipts from dues and subscriptions have increased nearly $700 and the 
proceeds from sales of back numbers have decreased nearly $500. In spite of 
increased prices, it was possible to reduce expenditures by approximately $800. 
Thus the Institute finds itself in a somewhat more favorable position than at the 
end of last year. 

Edwin C. Olds, 
Secretary-Treasurer. 

December 27, 1943. 


Library Commii’tbe. 

Secretary-Treasurer's Office 

. Printing and Supplies. 

Binding. 

Postage. 

Clerical Help. 


$5.25 

2,763.28 


CONSTITUTION 

OF THE 

INSTITUTE OF MATHEMATICAL STATISTICS 

ARTICLE I 
Name and Purpose 

1. This organization shall be known as the Institute of Mathematical Statistics. 

2. Its object shall be to promote the interests of mathematical statistics. 

ARTICLE II 
Membership 

1. The membership of the Institute shall consist of Members, Junior Members, Fellows, 
Honora^ Members, and Sustaining Members. 
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2. Voting members of the Institute shall be <a) the Fellows, and (b) all oihers, Junior 

Members excepted, who have been members fw twenty-three months prior to the date of 
voting, ‘ 

3, No person shall be a Junior Member of the Institute for more than a limited term as 
determined by the Committee on Membership and approved by the Board of Directors. 

ARTICLE III 

Officers, Board of Directors, and Committee on Membership 

1. The Officers of the Institute shall be a President, two Vice-Presidents, and a Secre¬ 
tary-Treasurer. The terms of office of the President and Vice-Presidents shall be one year 
and that of the Secretary-Treasurer three years. Elections shall be by majority ballots at 
Annual Meetings of the Institute. Voting may be in person or by mail. 

(a) Exception, The first group of Officers shall be elected by a majority vote of the in¬ 
dividuals present at the organization meeting, and shall serve until December 31, 1936. 

2. The Board of Directors of the Institute shall consist of the Officers, the two previous 
Presidents, and the Editor of the Official Journal of the Institute. 

3. The Institute shall have a Committee on Membership composed of three Fellows. 
At their first meeting subsequent to the adoption of this Constitution, the Board of Di¬ 
rectors shall elect three members as Fellows to serve as the Committee on Membership, 
one member of the Committee for a term of one year, another for a term of two years, 
and another for a term of three years. Thereafter the Board of Directors shall elect from 
among the Fellows one member annually at their first meeting after their election for a 
term of three years. The president shall designate one of the Vice-Presidents as Chairman 
of this Committee. 


ARTICLE IV 
Meetings 

1. A meeting for the presentation and discussion of paj)ers, for the election of Officers, 
and for the transaction of other lousiness of the Institute shall be held annually at such 
time as the Board of Directors may designate. Additional meetings may be called from 
time to time by the Board of Directors and shall be called at any time by the President 
upon written request from ten Fellows. Notice of the time and place of meeting shall be 
given to the meml)ership by the Secretary-Treasurer at least thirty days prior to the date 
set for the meeting. All meetings except executive sessions shall be open to the public. 
Only papers accepted by a Program Committee apiwinted by the President may be pre¬ 
sented to the Institute. 

2. The Board of Directors shall hold a meeting immediately after their election and 
again immediately before the expiration of their term. Other meetings of the Board may 
be held from time to time at the call of the President or any two members of the Board. 
Notice of each meeting of the Board, other than the two regular meetings, together with a 
statement of the business to be brought before the meeting, must be given to the members 
of the Board by the Secretary-Treasurer at least five days prior to the date set therefor. 
Should other business be passed upon, any member of the Board shall have the right to 
reopen the question at the next meeting. 
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3. The Committee on Membership shall hold a meeting immediately after the annual 
meeting of the Institute. Further meetings of the Committee may be held from time to 
time at the call of the Chairman or any member of the Committee provided notice of such 
call and the purpose of the meeting is given to the members of the Committee by the 
Secretary-Treasurer at least five days before the date set therefor. Should other business 
be passed upon, any member of the Committee shall have the right to reopen the question 
at the next meeting. 

4. At a regularly convened meeting of the Board of Directors, four members shall con¬ 
stitute a quorum. At a regularly convened meeting of the Committee on Membership, 
two members shall constitute a quorum. 

ARTICLE V 

PUBUCATIONS 

1. The Annals of Mathematical Statistics shall be the Official Journal for the Institute. 
The Editor of the Annals of Mathematical Statistics shall be a Fellow appointed by the 
Board of Directors of the Institute. The term of office of the Editor may be terminated at 
the discretion of the Board of Directors. 

2. Other publications may be originated by the Board of Directors as occasion arises. 

ARTICLE VI 
Expulsion or Suspension 

1. Except for non-payment of dues, no one shall be expelled or suspended except by 
action of the Board of Directors with not more than one negative vote. 

ARTICLE VII 

Amendments 

1. This constitution may be amended by an affirmative two-thirds vote at any regularly 
convened meeting of the Institute provided notice of such projK^sed amendment shall have 
been sent to each voting member by the Secretary-Treasurer at least thirty days before the 
date of the meeting at which the proposal is to be acted upon. Voting may be in person or 
by mail. 


BY-LAWS 

ARTICLE I 

Duties of the Officers, the Editor, Board of Directors, and Committee on Mem¬ 
bership 

1. The President, or in his absence, one of the Vice-Presidents, or in the absence of the 
President and both Vice-Presidents, a Fellow selected by vote of the Fellows present, shall 
preside at the meetings of the Institute and of the Board of Directors. At meetings of the 
Institute, the presiding officer shall vote only in the case of a tie, but at meetings of the 
Board of Directors he may vote in all cases. At least three months before the date of the 
annual meeting, the President shall appoint a Nominating Committee of three members. 
It shall be the duty of the Nominating Committee to make nominations for Officers to be 
elected at the annual meeting and the Secretary-Treasurer shall notify all voting members 
at least thirty days before the annual meeting. Additional nominations may be sub- 
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mitted in writing, if signed by at least ten Fellows of the Institute, up to the time of the 
meeting. 

2. The Secretary-Treasurer shall keep a full and accurate record of the proceedings at 
the meetings of the Institute and of the !l^ard of Directors, send out calls for said meetings 
and, with the approval of the President and the Board, carry on the corieepondence of the 
Institute. Subject to the direction of the Board, he shall have charge of the archives and 
other tangible and intangible property of the Institute, and once a year he shall puUish in 
the Annals of MalhenOiHcal Statistics a classified list of all Members and Fellows of the 
Institute. He shall send out calls for annual dues and acknowledge receipt of same; pay 
all bills approved by the President for expenditures authorised by the Board or the Insti¬ 
tute ; keep a detailed account of all receipts and expenditures, prepare a financial statement 
at the end of each year and present an abstract of the same at the annual meeting of the 
Institute after it has been audited by a Member or Fellow of the Institute appointed by the 
President as Auditor. The Auditor shall report to the President. 

3. Subject to the direction of the Board, the Editor shall be charged with the responsi¬ 
bility for all editorial matters concerning the editing of the Annals of Ma^kmaHcal Sta¬ 
tistics. He shall, with the advice and consent of the Board, appoint an Editorial Commit¬ 
tee of not less than twelve members to co-operate with him; four for a period of five years, 
four for a period of three years, and the remaining members for a period of two years, ap¬ 
pointments to be made annually as needed. All appointments to the Editorial Com¬ 
mittee shall terminate with the appointment of a new Editor. The Editor shall serve as 
editorial adviser in the publication of all scientific monographs and pamphlets authorized 
by the Board. 

4. The Board of Directors shall have charge of the funds and of the affairs of the In¬ 
stitute, with the exception of those affairs specifically assigned to the President or to the 
Committee on Membership, The Board shall have authority to fill all vacancies ad in¬ 
terim, occurring among the Officers, Board of Directors, or in any of the Committees. The 
Board may appoint such other committees as may be required from time to time to carry 
on the affairs of the Institute. 

5. The Compiittee on Membership shall prepare and make available through the Secre¬ 
tary-Treasurer an announcement indicating the qualifications requisite for the different 
grades of membership. 

ARTICLE II 
Dues 

Members shall pay five dollars at the time of admission to membership and shall receive 
the full current volume of the Official Journal. Thereafter, Members shall pay five dol¬ 
lars annual dues. The annual dues of Junior Members shall be two dollars and fifty cents. 

The annual dues of Fellows shall be five dollars. The annual dues of Sustaining Members 
shall be fifty dollars. Honorary Members shall be exempt from all dues. 

(a) Exception. In the case that two Members of the Institute are husband and wife 
and they elect to receive between them only one copy of the Official Journal, the annual 
dues of each shall be three dqllars and seventy-five cents. 

2. Annual dues shall be payable on the first day of January of each year. 

3. The annual dues of a Fellow, Member, or Junior Member include a subscription to the 
Official Journal. The annual dues of a Sustaining Member include two subscriptions to 
the Official Journal. 

4. It shall be the duty of the Secretary-Treasurer to notify by mail anyone whose dues 
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may be six months in arrears, and to accompany such notice by a copy of this Article. If 
such person fail to pay such dues within three months from the date of mailing such notice, 
the Secretary-Treasurer shall report the delinquent one to the Board of Directors, by whom 
the person’s name may be stricken from the rolls and all privileges of membership with¬ 
drawn, Such person may, however, be re-instated by the Board of Directors upon pay¬ 
ment of the arrears of dues. 


ARTICLE III 
Salaries 

1. The Institute shall not pay a salary to any Officer, Director, or member of any com¬ 
mittee. 


ARTICLE IV 
Amendments 

1. These By-Laws may be amended in the same manner as the Constitution or by a 
majority vote at any regularly convened meeting of the Institute, if the proposed amend¬ 
ment has been previously approved by the Board of Directors. 



FURTHER COHTRIBUTIONS TO THE PROBLEM OF SERIAL 

CORRELATION 

Bt WlLFMD J. DkON 
Princeton Unwenity 

1. latrodaction. Recently, there has been an increasing interest in the study 
of the serial correlation of observations. The development of the distribution 
theory and significance criteria was retarded by the fact that the successive dif¬ 
ferences or successive products of statistical variates are not independent. How¬ 
ever, these difficulties have been overcome to a considerable extent by recent work 
of several authors. In order to indicate the nature of the contributions embodied 
in the present paper, it will be necessary to describe rather precisely the contri¬ 
butions of these authors. 

Suppose Xi, Xi, • • • , a:„ are n independent observations of a random variable 
X which is normally distributed with mean a and variance <r*. Let us define 

5n-l = (*•+> ~ *<)* 5n = 23 (*<+i ~ 

n—1 n 

(1.1) C „^1 = (Xi - x)(xi+t - x) C„ = X) (*< - a:)(a;,+i - x) 

t-1 »->! 


lC„ = S (x< - x)(xu.L - x) F„ = X) - »)* 

t-1 

in which Xn^i = Xi . The ratio of any of the first five values to Vn will be a 
measure of the relation between the successive observations Xi . 

Von Neumann [2] has studied the ratio rj = d\^i/Vn . He obtains an expres¬ 
sion for the sampling distribution of the ratio r?. He solves the equivalent prob- 

n~l 

lem of determining the distribution of 21 where the point (j/i, 1 / 2 , • • • , j/n-i) 

n~l 

is uniformly distributed over the spherical surface 2Z = 1 and l-h® are the 

>-i 

m 

characteristic values of 6n-i . He obtains the distribution o)(y) of 7 = X) 

i-l 

(w even) where the point (xi, 0 : 2 , • • • , Xn) is uniformly distributed over the 

m 

spherical surface ^ x? = 1 and Bi> Bi> • • • > Bm. < 0 ( 7 ) is found by solving 
the equation 

(1.2) r (7 - *)-*"«(7) d7 = n (5* - *)■*. 

Jb^ i-i 


The distribution of 1 ) is then a special case of this distribution. The first four 
moments were obtained by Williams [5] by the use of a g«ierating function. In 
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the present paper we shall study the ratio S^/Vn . The moments of this ratio 
will be devdoped and the moments and approximate distribution of [2 — • 

Von Neumann [4] in a paper which removed a restriction (that m be even) 
on the distribution of n indicates how to determine the distribution of Cn-i/Vn • 
Koopmans [9] considers the stodiastical process Xt — pXt-i + (< => 1, 2, • • •). 

I p I < 1. The zi are independent drawings from a normal distribution with 
zero mean and variance v*. To test the hypothesis that p = 0 he shows that it 
is sufficient to know the distribution of C«_i/V». He finds the distribution of 
C„-.ifVn and Cn/Vn but finds that the numerical computation of these functions 
is very cumbersome. This prompts him to obtain approximate formulas for 
these distributions. The approximate formula for the distribution of f = 
Cn/Fn is 

(1.3) — 1)2*” tt”"^ / (cos a — f)*”“^ sin ^na sin a da, 

Jo 

A similar approximation will be used in this paper to find the moments of 
Cn/Vn . It will be shown how good the approximation is and how by using this 
approximation we may obtain a tabled function (Pearson Type I) which fits the 
distribution of 1 — {Cn/Vnf up to in moments. 

The quantity 1 — {Cn/Vnfi we shall find, is equivalent to a likelihood ratio 
function for testing the hypothesis that the serial correlation is zero. 

Anderson [8] obtained the distribution of iCnlVn = iMn . He proved that the 
distribution of iRn is the same as that of iRn when L and n are prime to each 
other. He has computed the 1 per cent and 5 per cent significance values 
(L = 1) up to n = 75. For values of n > 75 he indicates that a normal distribu¬ 
tion which is an asymptotic approximation may be used. He has also computed 
some significance values for the cases of N/L = 2, 3, 4. 

- In this paper we shall develop the moments of iJRn . 

The use of the ratio rj in the study of serial effects in ballistics at Aberdeen 
Proving Ground is given in references [1] and [2]. The use of the ratios Cn-i/Fn 
and Cn/Vn in the study of economic time series is discussed by Koopmans [9]. 

2 . Likelihood criteria. Given a sample of n observations xi, X 2 , • • • , Xn 
we shall assume that they are distributed according to the law: 

( 1 -A s 

dxi*- dxn, ^ (1 < i < n). 

It will be convenient to use the phraseology that ‘‘the variate at the time a has 
as its mean value a linear function of the variate at the time a — I,” We shall 
take Xi == Xn+». Due to the sjrmmetry we may use a + lia place of a — Z. 
This will be done to obtain agreement with previous work. We wish to test 
the hypothesis Hi that each variate is independent of the other variates, that i6, 
that b = 0. The Neyman-Pearson specification of Hi may be written as follows, 
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m 


where Q is the space of admissible values of a and b, and <a the subspace de¬ 
fining Hi: 


( 2 . 2 ) 


f0:<r®>0 —oo<a, b < <x> 
[a>;<r*>0 —oo<a<oc, 6 = 0. 


The likelihood criterion Xi suitable to this hypothesis is the ratio of the maximum 
(w (max.)) of (2.1) with the restriction that 6 = 0 to the maximum (0 (max.)) 
of (2.1) without this condition. Now, 


(2.3) 


dPn(<») max) 
dPn(0 max) * 


We see that the likelihood function is 


(2.4) 


1 ^ 

L - —n log (s/^irff) — ^ ^ (Xa — a — bxa+if 

^<r a»l 


and to maximize L over the space Q we compute the following derivatives 


da 



a - bxa+t), 


(2.5) 


~ 2(x« - a - bXa^l)Xa^l, 
do 

dL n , 1 r \ 


The solutions of the equations obtained by setting the above derivatives equal 
to zero are: 

s= 1 S(x« — d — 

71 

(2.6) = - 2a:«(l - h) 

n 

nZxl — (2a;a)* 

If we now maximize L over the space w we obtain 

= - 2(x„ - df 
n 

(2.7) 

d = - 2 .t« 
n 

so that we will have 

(2.8) (iPn(« max) = [2ir(l - 6*)2(x. - x)*]'^ V‘", 

(2.9) dPniu max) = [2ir2(.'i;„ - *)*]“*"«"*", 

(2.10) Xi - (1 - £*)*", 
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where b is Refined as in (2.6) above. If we set a = 0 in (2.1) we may follow ^ 
similar procedure and find the criterion oXi == (1 — So)*" for testing the hypothesis 
that 6 = 0 if it is known that the population mean equal zero. Here 

(2.11) bo 

ZXa 

We notice that 6 is the criterion chosen by R. L. Anderson as a measure of 
serial correlation. He has obtained the distribution of 6 and has computed a 
number of significance values from this distribution. The distribution is a 
function of n and I, and Anderson points out that the larger the values of n the 
smaller the extent to which the significance values depend on L 

In the next section we shall find a distribution which approximates very 
closely the exact distribution of 6 and which is independent of the lag L 


3. Moments of the likelihood criteria. We shall determine the moments of 
ho and oXi when the hypothesis oHi is true and the moments of 6 and Xi when the 
hypothesis Hi is true. I^t us first consider ho = H^XaXa+i/'^Xa , the criterion we 
obtained for testing the hypothesis oHi . The moment generating function for 
the joint distribution of Co = 'ZXo^a-^i/or^ and Vo = is 


h) = A’[exp {Cot\ + ^ 0 ^ 2 )] 


(3.1) 


■ (vs.) /v 7 “ 2? ^•) .5 ■ 


By reference to this last expression it can be seen that 

(3.2) ^ _ I > 

and further 
<3.3) 


i?(&o) = r dto, 

B®) -/•■•/ tldh,. 

J-co J di\ J«i-o 


in which we set < 2 ,. 

j-i 

Now if we write (3.1) as follows: 


(3.4) 




we see tliat , fe) = 1 \~^ and ; that is, this determinant is 

a circulant. Let us write = ah where h equal j — i+1 or j — i+l+n 
taking that subscript which gives 1 < < n, so that we have 


(f , Ut) = 


ai 

02 

Os • • • 

On 


Ol 

02 • • * 

On-1 

^ 02 

0 * 

O4 • • • 

Ol 


(3.5) 
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which expanded by the method of eirculants becomes 

n 12 

ib-l t-1 

where the ci)k are the nth roots of unity. 

First we shall consider U (lag 1). Here ai == 1 — 2^2 ,02 = On — —mid 0$ 
to ttn^i equal zero. 

n 

U) = n (1 2fe ~ 

... . A-1 


= n(i- 


2^2 — 2t\ cos 


For lag ai == 1 — 2 / 2 , = an-z+i = — <1 and the remaining a’s = 0. 

n 

Qip\ ^{t\ , U) = JJ (1 — 2fe — <l(c0fc + Wib*)) 
fc-1 


= 11 1 -2f2-2<iCos"'‘-"' . 

ib-i \ n / 

We shall develop an approximation to theses functions (3.6) and (3.7) as follows; 

1 ** 

, —5 S (il+B OOS ttjfc) 

(3.8) QipiiU > ^- 2 ) = H (^4 + B cos ak) * = c , 

fc-i 

where in tliis case A = 1 — 2^2, B = ’“2<i and ak = 2vlk/n, Let us now alter 
the form of this exponent and replace the sum of a finite number of terms in¬ 
volving ak by an integral of a continuous variable a. 

(3.9) dipiiti , U) = exp (-— — ^ log (A + B cos ak ^. 

\ 4x4 n ifc-i / 

I^et us write 2rlln = Aa* , then we shall have 


o<^i(<i, /^) = exp 


(sS 


log (A + i? cos ajt)Aaifc 


If we take ^ < A we see that A + 15 cos a* is never negative; therefore as 
we let n increase the summation will approach the value of the integral: 

n2irl 

/ log {A + B cos a) da. Let us then replace this summation by its limiting 
Jo 

integral. The resulting function will then be an approximation for n large 
enough. We shall obtain then 


0^1 (^ 1 , h) exp log (A + B cos 


a) da) 


B < A, 


from which by the xise of the integral* we obtain 

(3.12) ov^(<i, <i) ~ exp (—§n 1<^ i[A + V-A* - B*)). 

' This integral may be verified by differentiation with respect to the parameter a . It 
may be found as formula 523 in Pierce’s *‘Short Table of Integrals.” 



124 


WILFRID J. DIXON 


So that 

(3.13) , fe) ^ IhiA- H“ \/A^ — B^)] 

We now have , fe) represented approximately by a power of a single quantity. 
The question of how good this approximation is will be discussed in a later 
paragraph. This is similar to the approximation used by T. Koopmans in the 
distribution of &o. He makes the substitution “to obtain what intuitively seems 

T 

to be in some sense the closest approximation/’ He approximates II (k — 

1-1 

2irt 

where h — cos — by the process followed in (3.8)-(3.13) in order to find the 

distribution given in (1.3). 

To obtain the corresponding function for 

‘ ^ Zxi ~ (Sx«)Vn ’ 

we follow the same procedure as above for bo . Here 

C = [2a;.a:.+, - 
V = [Sx^ - (Sx,)Vn]/<T*, 


and in (3.5) ai = 1 — + 2(ti + t 2 )/n; oa = Un = —ti + 2{ii + t^)/n; and all 

the other a’s = 2(ti + h)/n so that the expansion of this circulant becomes 


fe) 


(3.15) 


n |[1 — 2<2 + 2(^1 + fa)/w] + [—+ 2(^1 + t^/n[{tj)k + 

fc-i i 

+ [2{t\ + fe)/n] 

t-8 


and since 

1-1 /~(w* + + 1) 

t-8 t n — 3 

we get 

(3.16) n {1 - - tiiw, + «*')), 

and for lag I we get 

(3.17) n {1 - % - tiU + wr')}. 

ib-1 


k 9^ fly 

k ^ n, 


These two results are the same as those obtained previously except that the 
final term, 1 — 2^i — 2<2 = A + JS, of the products is missing. We will then 
obtain as an approximation to these finite products ipT^ =* o^rV(^ + B) or 

(3.18) viiti, k) = [KA + VI^~'W)r^WA+^ 

where i4 = 1 — 2fe and B = —2ii. 
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A method of fincUng the moments of ho and & was outlined in (3.1), (3.2) aiid 
(3.3) above. If we perfonn these (^raticHis on vpiih, tt) as defined in (3.18) 
we find 


, -ln~l dW 




du — 2 

where m |o = 1 — 2<s, ^ = 0, = ;;- ^ , etc., and the zero subscript 

Oil 

indicates that h has been set equal to zero after differentiation. If these values 
are substituted and the required number of intentions with respect to <2 are 
performed, we find the moments of the criterion ho when offi is true. 


il/i = 0 


n + 2 


3/o = 0 il/4 = 


(n + 2)(n 4- 4) 


( 3 . 20 ) 


3fo = 0 M. = 


= 0 




(n + 2)(n + 4)(n + 6) 


etc., or 


1-3-5 ••• (21: - 1) 


(n + 2)(n + 4) • • • (n + 2A:) ' 

Mik may be verified by the use of an expansion of the generating function (3.13) 
by a method of Laplace [10]. He gives the expansion of «“* where u is given by 
the equation u~ — 2u + e* = 0 as follows: 

f _ 1 ie^ i{i + 3)e‘ t(t + 4)(f + 5)e® 

“ 2* 2‘+- 1.2~2‘+< 1.2.3.2‘+* 


I , t(t + 1: 4" 1) • • • (» 4- 21: — l)c** . 

(1: - i)! . 2*«* -'-•••• 

We see that m = 1 4- y/l — and if we set e = ii/(J — f») and i = in. We 
obtain vpi = u~*” as a series in the even powers of . From this we can see that 
the odd moments are zero and from the form of the coefficients we can verify 
Mok. 

These moments are not contained in the works of the other authors writing 
on this subject. Although these moments are obtained from an approximate 
generating function they are, as will be shown later, the exact, not approximate, 
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moments for k < n, for lag 1 and are the exact moments for k < n/a for any lag 
I where a is the largest common factor of n and the lag Z. 

To obtain the moments of b we follow a similar procedure with 
(Pi = — 2Zi ~ 2k)\ Differentiating (pi the required number of times with 

respect to h and integrating an eciual number of times with respect to fe gives the 
following moments: 


(3.21) 


ilf, = i- Mi 

n — 1 

" (n3) 


1_ 
n + 1 

_ 3 _ 

(n+iKn + 3) 


-1.3-5 (2A; - 1) 

' (/i — l)(/i + 3)(W' + 5) • • * (n + 2Z? — 1) 


_ _ 1>3>5 . .. (2 A; - 1)_ 

(n + l)(?i + 3) • • * (n + 2fc — 1) 


Examination of the moments of bo will show that bo = x is distributed accord¬ 
ing to the law 

(3.22) Ki{l - - Ki{l + 


up to n moments. This distribution is symmetric and we may wish a normal 
approximation to this curve. The mean is zero and the variance is l/(n + 2). 
The X criterion oXf"" = (I ~ hi) = y is distributed according to the law 


(3.23) 






Yl j 1 

up to in moments. Here the mean is — 7 - 7 ; and the variance is 


2 (n + 1 )_ 

'n +2 ----'(71 + 2)\n + 4) ’ 

If we inspect the moments of b we see that the distribution of X?^'* = (1 — fo“) == z 
follows the law 


(3.24) 


K,{1 ~ 2 )-’^ 


r-U(n-2) 


up to in moments, which is the same as the distribution immediately preceding 
except that n is replaced by n — 1 . The distributions (3.22), (3.23), and (3.24) 
are the same for lag I except that the fit is up to n/a, n/2a, and n/2a moments 
respectively where a is the largest common factor of I and n. These restrictions 
are necessary since the moments as given in (3.20) and (3.21) are obtained from 
the approximate generating functions (3.13) and (3.18). The exact generating 
function is given in (4.8) for lag 1 and it is found that the 7 ^th or higher deriva¬ 
tives bring contributions from the part of the generating function which was 
neglected in approximating the generating fimction. The additional restriction 
for lag Z 5 ^ 1 will be seen in the last two paragraphs of section 4. The extra 
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factor \ in the second and third case above is due to the fact that only the even 
moments of (3.20) and (3.21) are used. 

We have in (3.23) and (3.24), then, very close approximations to the dis¬ 
tributions for the two X criteria for testing serial effects. 

The following table is a comparison of the exact and approximate 5 per cent 
and 1 per cent points for the distribution of S. The exact values are taken from 
the table given by R. L. Anderson. The normal approximation as given by 
Anderson in his table does not show such close agreement since he used an asymp¬ 
totic second moment. He indicated that the exact values would have to be 
used for values of n < 75 in place of the values from the normal approximation 
which he obtained. Here we see that the normal approximation may be used 
for n somewhat less than 75. The Pearson Type I approximation was obtained 
by using the first two moments of h. The curve obtained is: 


(3.25) 


(1 + x) ^\l - 


, . , (n - l)(n - 2) ■ n(n - 1) 

- 2 (» 

U'he exact values marked with an asterisk in the table differ slightly from 
those previously published. They ai*c more precise values from the exact dis¬ 
tribution which R. li. Andei’son has made available to the author. 


Positive tail 


N 

Exact 

S% 

Type I 

Normal 

Exact 

i% 
Type I 

Normal 

5 

.253 

.317 

.281 

.297 

.527 

.501 

10 

.300 

.302 

.350 

.525 

.533 

.541 

15 

.328 

.329 

.323 

.475 

.477 

.480 

20 

.200 

.299 

.290 

.432 

.433 

.440 

25 

.270 

.270 

.274 

.398 

.398 

.404 

:io 

.257 

.257 

.255 

.370 

.371 

.375 

45. 

.218 

.218 

.217 

.313* 

.313 

.310 

75 

.174* 

.174 

.174 

• .250 

.250 

.251 

N 

Exact 

B% 

Type I 

Negative tail 

Normal 

Exact 

1% 

Type I 

Normal 

5 

.753 

.742 

.781 

.798 

.858 

1.000 

10 

.504 

.502 

.572 

.705 

.702 

.703 

15 

.402 

.401 

.400 

.597 

.590 

.029 

20 

.399 

.399 

.401 

.524 

.524 

.545 

25 

.350 

.350 

.357 

.473 

.473 

.487 

30 

.324* 

.324 

.324 

.433 

.433 

.444 

45 

.202 

.202 

.202 

.350 

.350 

.362 

75 

.201* 

.201 

.201 

.270 

.270 

.278 
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4. Alternative expansions of the generating functions. In this section we 
shall determine the exact generating functions which were approximated in (3.12) 
and (3.16) and obtain these same approximations in another manner. This 
development will enable us to see how good the approximation is in the sense 
that it gives a certain number of exact moments. The determinant in (3.4) for 
lag I and mean zero can be written 


(4.1) 


An 


ah b 

hah 
bah 0 


h 


0 


bah 
b a 


In 


where a = 1 — 2<2, b = —and all the other elements are zero. The h in the 
upper right corner and lower left comer indicate the value h in the ain and Oni 
positions. Let us define the following determinants: 


(4.2) 


a b 


b a b 

b a b 


b a b 

b a b 0 


b a b 0 


Cn = 


0 


0 

b a b 


1 h a 

b b a 

n 

\b b 


Pn 


a b 
bah 
b a b 0 


We see that 


jo b a b 

I ban 

An ^ Bn + i-iy-^bCn-l, 


(4.3) 


Bn = Dn+ (-1)"-V, 


C„ = 6" + 


and An can be expressed in terms of Dn by substituting for Bn and Cn in the equa¬ 
tion for An . 

(4.4) 4„ = i)„-6*Z)n_2 + 2(-l)"-V. 

We can obtain an e:q>ression for D„ if we expand this determinant by the first 
row. This gives 

(4.5) 


Dn = oDn-X — h*Dn-t . 
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Since this is a second order difference eqiiation, the solution may be written 
2>f» == ifciM* + where w, v are roots of the equation — ox + 0. Now, 

Di ^ u + V = kiu + kiu, 

(4.6) 

A == + t;* + tit; = kiu^ + hv^ 

so that we can determine fci and k 2 . We now see that 


which upon substitution in the equation for An gives 

A„ = yr + v’' + 2(-l)"-V, 

(4.8) 

= u" + t;" - 2<r, 

where u, v = ^(1 — 2fe) zfc \/(l — 2fe)^ — Now , fe) == An* and it is 
easily seen from the form of An directly above that derivatives up to the nth 
order with respect to h in which h is then set equal to zero will be given by de¬ 
rivatives of An = n” and this is the approximation (3.13) found by other methods. 
The determinant in (3.4) for lag 1 and mean not equal to zero can be written 


a h 
bah 
hah 


a = 1 — 2^2 H” 2(ti -f- k)/Uy 
h = — + 2(^1 + t2)/ny 


2{ti + tt)/n. 


bah 

h ha 

Lei us define the following determinants: 


hah 

hah 

hah 


a h 
bah 
bah 


(4.10) 


hah 

hah 

hah 


a h 
hah 
bah 


hah 
b a 


hah 
b a 
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(4.12) 


If we replace the h in the upper right comer of Anhy c + (h — c) we obtain 
(4.11) An = Cn + (-!)""'(& - c)Bn^i . 

If we replace the h in the lower left corner of Bn and Cn by c + (b — c) we obtain 
Bn = Dn+ c)En-l , 

Cn-^n+(-l)""*(6~c)Z)n-l. 

We now have An in termfi of Dn and En . We must now evaluate Dn and En . 

Ill--* 1 111--- 1 

0 h a h —c r 8 r 

0 hah — c r s r 0 

I 

( 4 . 13 ) Dn = 


0 


ha r s 

^ In-fl 1 n+1 

where r = b — c and s = a — c and the second determinant above is obtained 
from the first by subtracting c times the first row from all other rows. Writing 
this last determinant as the sum of two determinants by separating the first 
column we get 

(4.14) Dn = r" - cFn+i. 

Combining the above difference equations we obtain 

(4.15) .4„ = + 2(-l)”rcF„ + 2(-l)’‘-V" 

and see that we must obtain En and Fn . 

Expansion of E«+i by the second column gives 

(4.16) Fn+i = -Cn + rFn 
and expanding Gn by the last row we get 


(4.17) 


(,\ = rGn-t + i-iy-'Hn-i 


n+l 


(4.18) 


0 1 1 • • 1 


1 s r 

1 r s r 0 


1 r s r 0 

1 r s r 


1 r 8 r 


II 


0 


0 

r s 


r 8 

1 T 

»-i 1 

1 r 


Hn = 


s r 
r 8 r 
r 8 r 


T 8 T 
T 8 
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Hn is the same type as (4.1), therefore ^-where u and v are the 

1/ V 

roots of the equation x — ex + ^ 0, so that (4.17) becomes 


Gn — = ( — 1 ) 


•-1 u — V 


u — V 

and the solution of this equation gives 

(4.20) e. - .7 

2r + s (w — v)(2r + s) 

Introducing this expression into (4.16) we find 


(4.21) 


Fn = (-1) 


«-l u — V 


{u — y)(2r + 8) 2r + « 


1 1 1 • • • 1 


1 1 1 • • • 1 

0 a b 


— c 8 r 

0 h a b c 


— c r 8 r 0 

c 


. ... 

0 

b a b 


r 8 r 

0 b a 

fi+i 

— c r 8 


(4.22) En 


where the second determinant is obtained from the first by subtracting c times 
the first row from all other rows. If we separate this last determinant on the 
first column we get 


10 1 1 • 
; 1 s r 
I 1 r s r 


Efi ““ Eln 
•11 ! 


0 1 1 • • • 

1 s r 

1 r s r 0 


r s r 
r 8 
r r 


Expanding In by the last row, we get 

(4.25) In = ("“I)** + s/n-l . 

Expanding Jn by the last column, we get 

(4.26) Jn = (- 1)""‘(?„-1 + rIn -1 . 

If we combine these last two equations we find 

(4.27) In - aln-l + r*In-2 = (-1)-*(G',_, + td^). 
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If we now solve this difference equation for /», substitute this solution in the 
equation for , and in turn substitute this result and the expression we ob¬ 
tained for F» in. (4.15), we get 


(4.28) 


u" + »" -h 2(-l)"-‘r" 
2 r + « 

«“ + !>"- 2«r 

1 - 2<i - 2<* ■ 


The final form results since 2r + « = 1 — 2ti — 2<t, r = — . w and v have the 
same values as before. If we compare this result with that obtained in (4.8) for 
mean equal to zero we see that this is the same except that here we have the 
added factor, 1 — 2ti — 2<j, in the denominator. We have a similar result then 
for the approximation for derivatives of , fa) = for fa = 0. Here this 
approximation is An = uV(l ~ 3fa — 2fa), the same result as that obtained in 
(3.18). This approximation will yield moments which are exact for n > ofc for 
any lag I where a is the largest common factor of n and 1. The reason for this 
restriction is easily seen if we consider the expansion obtained in (3.7), for 


(4.29) 


, fa) 


dfa 


— , fa) 


i-i 


—2 cos 


2itlk 

n 


1 — 2fa — 2fa cos 


2^- 

n 


with fa = 0, 


(4.30) 


Further 


dti 


'^1 J^l"® 

^(0, fa) 


—2 cos 


2Wfc 

n 


1 o, 

1 — 2fa k-i 11 


1 - 2fa 
2irlk 


(4.31) 




g(fa, fa)~| 


_ <p(0) fa) 

(1 2 fe)‘ 


r(£c»?fY+2tc„s=?fl 

L\*-i ^ / fc-i w J 


and the mth partial derivative will contain the sum of the mth powers of the 
cosines. These are the sums of the powers of the real parts of the roots of unity 

2vlk 2irk 

and it is easily seen that X cos’"-= S cos’" — only for m < ak where a is the 

n n 

largest common factor of n and L 

To change the moment generating function of 6o to that of b we must drop the 

n—I ^irlk 

last term of the product. In the above expressions we then have 23 cos’"- 

n 

and the same conclusion will hold. 
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6. Applifiation to successive differences. If we change slightly the function 
V = fin-i/Vn investigated by von Neumann and Williams we find the moments 
and distribution greatly simplified. Let us define 


(5.1) 5* ■= S (®« “ 

<-1 

where Xn+i = Xi and conrader the ratio ovi of to . Now, 

(5.2) in 2Xx* — 22XjXi4.i 
therefore 

(5.3) • 01 J 1 « —, = 2(1 - fco) 

and we may find the moments and distribution of oin directly from those of 
ho . We find the moments to be: 


(5.4) 


wii = 2 


nh 


2*(n + 3) 
n + 2 


mt 


nik 


2*(ra + 5) _ 2*(n + 5)(n + 7) 

n "j- 2 (n 2)(n 4) 

_ 2*(n + 2fc - 1)1 _ 

(n + *)! (n + 2)(n + 4) • • • (n + 21: — 2) ’ 


{k < n) 


and the ratio ovi is distributed according to the law 
(5.5) **(4 ~ ovx)**" 


up to n moments. 

If we replace ip the above ratios by Xj — x we find the moments of the ratio 
Iji = 5*n/2(x< - if to be: 


m\ 



J*n(n + 3) 

(n — IKn + 1) 


(5.6) OTs 


2’n(n + 4)(n + 6) 

(n — l)(n + l)(n + 3) 


2‘n(n + 5)(n + 6)(n + 7) 

(n - l)(n + l)(n + 3)(n + 5) 


2*n(n + 2fc — 1)1 

~ (n + fc)! (n — l)(n + l)(n + 3) • • • (n + 21: — 3) 


and (i?i — 2)* = 2 has the distribution 
(5.7) C,2-*(4 - 2)*‘"-» 


up to \n moments. 

The ratio h\/^x\ compares the variation of the first differences to that of the 
original variates. We might wish to compare the variation of the second 
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differences to that of the first differences. For this purpose let us form the ratio 


(5.8) 


^2 




S - Xi+if 


Xn+l = Xl 
Xn^i = Xi 


to test the hypothesis that the variation of the second differences compared 
to the variation of the first diffei-ences is such as would occur by chance. Let 
Xl, X 2 Xn he normally distributed ^ith mean a and variance <r®. The 

ratio 1/2 is independent of the mean value of the variates, therefore we may 
consider a distribution with mean eciual zero. We shall develop the mean and 
variance of i ?2 when the hypothesis to be tested is true. The moment generating 
function for the joint distribution of A == S(xt — 2xi+i + I2g and Z>i = 
^{Xi — Xi+if/2<r^ is 

< 2 ) = -^[exp (7^2 4* 7 ) 1 ^ 2 )] 

(5 0) /I \” r®® r / 1 \ ” 

■ (vs.) L' l H*'- 

We may find the moments then by a process similar to that outlined in (3.2) 
and (3.3), The next few steps are identical with (3.4) and (3.5). For the pres¬ 
ent problem, however, ai = 1 — 6 h — 2^2 , 02 = 4<i + ^2 , 03 = so that 

n 

<P ~{U, U) — IT [1 — ()<i — 2<j + (4<i + < 2 )(to* + ctffc') — <i(a>| + (Ot*)], 


( 5 . 10 ) 


-n[“ 


6<i 


2h 4" (8<i 4* 2^) cos — — 2t\ cos 
n 


-vy 


. , 2Trk , ^vk 

4 - 0 cos-h c cos — 

n 


Trfcl 
n J‘ 


If we follow the same procedure indicated in (3.8) to (3.13) we obtain succes¬ 
sively 

n 

(5.11) to) = n (<^ + ^^ ak 4“ c cos 2ak)~^ 

k^l 


(5.12) 

0 

= e 

(5.13) 

= exp ^ 

and replace the summation 

as n —» 

00 . 

(5.14) 

<p(<i, <s) ~ exp 

(5.15) 

exp (■ 


/~n ^ Y 
\ 4t n h 


log {a + h cos ak + c cos 2ak) 


) 


exp log - 


log (a + b cos a + c cos 2a) da^ 

+ Vi'1 + Vi~-V 


]) 
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where 


ic as a — o; iy, 6 


^ b :fc \/y + 8c« - 8oc 
2(a — c) 


We then have approximately 

(5.16) , <0 - hkh + vi^)(i + \/rr^)r*". 

(5.14) follows from (6.13) if we replace the summation by an integral, and (6.16) 
is obtained in the following manner: replace cos 2a by 2 cos* a — 1 and factor 
the resulting quadratic and integrate the factors separately. 

|*2ir ^2ir 

/ log (a + 6 cos a + c cos 2a) da = / log (a ~ c + 6 cos a + 2c cos* a) da 
Jq Jo 

m 2 t a2w 02r 

(5.17) = J log K da + J log (1 + 5 cos a) da + jj^ log (1 + COS a) da 

= 2ir log fc + 27r log ^(1 + \/l — 5®) + 27r log ^(1 + \/l — rf^). 

If we now expand (5.16) by multiplying the factors within the brackets and 
substitute for k, ij and 5 we find 


<p(ii ,k)~[A + B + C + or*” = p-*", 


where 


a = 1 — 6<i ~ 2^2, 
b = 8li + 2^2 f 




A « J (1 - 4 <i - 2 fe), 

= J[1 — 12^1 — 4fe + Sills + 2^2 
— 2(4^1 + <2)V^ ^2 + 4fi]*, 
C = 4[1 — 12/i — 4fe + Sfife + 2^2 
+ 2(4^1 + i^y/ A + j 

D = J(1 - 16<i - 4fe)*. 


From (5.18) P = A + B + C + D and at = 0 

p = id + vr ^)*, 

(5.20) ^ = -2[1 + 2(1 - 4fe)"‘], 


(1 - Vl - 4fe)“ 


'dt\ (1 - 4&)* 




If - - »(f)’+ 


(5.21) 
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K we substitute in this formula and integrate the first with respect to it we shall 
obtain the first moment of the ratio ijs. If we integrate the second twice with 
respect to it, we shall obtain the second moment of the ratio . We find these 
moments to be 


(5.22) 


,, 3n + 2 ,, 9n* + 23w + 12 

= rT+l = (n + l)(n +-2) 

2 _ 2n* + 7n + 4 
(n + l)*(n + 2) • 


6. Likelihood criteria for multiple serial correlation. Given a sample of 
n observations, Xt, X 2 , ■ ■ ■ , x. , we shall assume that they are distributed 
according to the law 


( 6 . 1 ) 





-a- S 


biXa^l 


.)■ 


dxi • • • dXn 


(Xi = Xn+i) 


that is, that each variate, say at the time t, has as its mean value a linear function 
of the variates at time i — hyt—Uy etc. Let us investigate the likelihood criteria 
for testing the hypothesis. Hr , that each variate is independent of the others; 
i.e. that the 6,- = 0 (i = 1, • * • , r). For the hypothesis Hr we define the space 
and the space co, as follows: 


( 6 . 2 ) 


il: a > 0 - 00 < a, b, < oo 

w: <r* > 0 — 00 < a < 00, 5^ = 0, 


we find the likelihood ratio criterion 


(6.3) 
in which 


(6.4) 


^ 2 /n _ I I 

Goo I Clpq I 

Oflo = 23 ~ 

a 

Ooi = S (a!« — *)(a^<>+!< - £) 

a 

Oii = 23 (a^«+i.- - •^)(a;«+J,- - x) 


I, j = 0, 1, • • • , r 

p, g = 1, • • •, »• 


and it is noted that an = ooo and if the h are equispaced at,t+c = ooc. Xr is a 
statistic which measures how completely each variate at time t can be expressed 
as a linear function of variates spaced at time t — ky t — etc. 

Next we shall develop a statistic for testing the hypothesis, Hr,m , that of the 
set of the values (i = 1, • • • , r) in (6.1) the subset bm+l , htn+2 , * * * , br *“ 0. 
Here we have the same likelihood function but for Hr,m , we define the spaces 
Q and a> as follows: 

ffl: <r^ > 0 — oo < a, b< < 00 : 

^ ^ \(a: <r® > 0 — oo < a, bu < oo, b^, = 0, 

w = (1, • • • , m), I/? = (m + 1, • • • , r), 



and obtain the criteri(Hi 
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( 6 . 6 ) 


\*/» _ I 11 <»•* 1 
~ |ap,llo-l 


i,j - 0,1, •••, r, 
p, 5 - - 1, • • •, r, 

Sf t Oj 1| * * * > 
tt, » ■« 1, • • •, m, 
{m < r). 


The form and the derivation of these X criteria parallels very closely that of the 
likelihood ratio criteria obtained in multivariate analjrsis for testing s^ificance 
of regression coefficients. 

Case I. If we set r = 1 in X, we obtain 


(6.7) 


x!'" = 


Ooo Oot 
Ooi Ooo 


floo 


<ko Ooo 


for which the distribution is given in (3.24). 
Case II. If we set r = 2, we have 


On 


( 6 . 8 ) 


for which if we take — 1, ?2 = 2 we get 


(6.9) 


The expanded form of this numerator is oJo + 2a?iao2 



Ooo 

Ool 

O02 

Ool 

Ou 

012 


Ooo 

Oil 

2 we get 

012 

Ooo 

Ool 

O02 

Uoi 

Ooo 

Ool 

ao2 

Ool 

Ooo 

Ooo 

1 Ooo 

Ool 

1 Ool 

Ooo 


002 

Oi2 


Let us consider 

( 6 . 10 ) <p( 8 o, e,, 82 ) = (^-) • 7 


2 rt 2 

O00O02 — 2 ooiOoo • 


——r{ Sxjd—SxaX«+1—®srXatXa+2} 


n dXa 


We shall find the mean and variance of 0 X 2 ^** (mean == 0) when the hypothesis 
oHr (r = 2) is true. We can find the first moment of 0 X 2 ^” then by performing 
the following operations: (a) compute 


( 6 . 11 ) 


, 2 ^ 

do! del 662 


dtp fp 
^0 ^ 


2^^ 
del sOo' 


(this will give the first moment of the numerator) and set 62 = 0, (6) integrate 
from — 00 to 00 + with respect to 00 + 01 “ r> from — 00 to 0o — 0i, with 
respect to 0o — 0i = f, and set 0i = 0 (at this point we will have the first moment 
of the third order determinant divided by the second order determinant), (c) 
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integrate with respect to 6o from — oo to 0. The reason for step (b) is easily 
seen since the second order determinant a^o — chi may be written (ooo — Ooi) 
(oco + Ooi)* 

Further moments may be computed in a similar manner. o<^2(^o, , ^2) 

may be written as a determinant in the manner indicated in (3.4) and (3.5), 

Here, ai = 1 — ^01 02 = an = and as = Un-i = —^^2 and ai to an~2 = 0, 

then 

n n n / 2^^* 4irk\ 

(6.12) o<p2^(^oy ^1, ^2) = IT a,-Wife ^ = IX { ai + 2a2 cos — + 2a3 cos — j. 

fc-i t-i ife-i \ n n / 

We shall approximate o^(^o, Bi , 62) by the method contained between (5.11) 
and (5.18). We set 

(6.13) q<P 2{6 q y 61, 62) = n ( a + 5 cos + c cos ) 

fc-i \ n n / 

and obtained 

(6.14) o<,^2(^o , 61 y 62 ) [A + B C + D] = P 

where 

A = \{a — o') 

B = -|(4a' - 4c' ~ 25' - 2hE^f = 

(6.15) C = i(4a' - 4c' - 25' + 2hE^)^ - 

D = i((o + cf - by = J** 

P = 5' + 8c(c — a). 


It is easily seen that we may operate (differentiate and integrate) with a, 5, 
c, in place of ^ 0 , , ^2 respectively. Therefore, we compute 


(6.16) 


dc dc 




and since P=^A + B + C + E> we compute 
dA 1 d^A „ 


(6.17) 
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In order to evaluate the expressions in (6.17) we must find 
dc do L 5 


(6,8) £2.-8c + 6E-^( 


- 2(<i + c): 


o iTT-if l/aiSV . 
d* 7 _ O I ET“i r 1 /, ip 


' ill® 

« 2 . 

dc» 


If we now set c = 0, we obtain 

P = i(a + (o* - 6‘)*) 
dP _ a - (a* - 6*)* 

“ ■ 2(a* - W 

a'P _ 2a‘ - 4a"6* + b* + (-2a* + 2a5")(o" - 6*)* 
ac* 25*(a* - 6*)» 

We may now substitute these values in (6.16) and then substitute the resulting 
expressions in (6.11). The remaining values that are required for (6.11) are 
easily computed since they may be obtained from <p with c = 0, i.e. 

(6.20) ,<n(eo , fli, 0) = [i(o + (o* - 

The result of these substitutions gives 

(6 21) -«"(n + 3)P->'- " 

8 (a* - ’ 

in which we set d = i(a — b) and e = |(a + h) and integrate with respect to 


and if we set 6 = 0 and integrate with respect to a, setting a = 1 , (^o = 0 ), we 
finally have 


(6.23) 


£(oX^'") 


2/nx __ 


n + 2 ‘ 


We shall now obtain the first moment of X 2 without the restriction that the 
mean equal zero. For this purpose let us consider 

(6.24) w(ao, a,, a,) = n . 

Here ai = 1 — + m, 02 = a« = + m, Oa = On-i = —+ m, 04 to 

an ~2 = m where m = (^0 + + Bi)/n, Expanding the determinant as in 

( 6 . 12 ) we find 

n n 

<pT^(fiof Oiy $i) = n S 


TT / j o 2irfc . 4irfc . 1 \ 

11 (ai+ 202 cos — + 2a«cos — + 2^1. 

*-l \ n R i^mA / 


(6.25) 
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so that 
(6.27) 
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n—2 n—2 

23 = m 23 

«-4 

—m(l + + co^ + « *), 

m(ri — 5), 


fc 7^ n, 
A; = n, 


<^2 *(^o, , ^ 2 ) 


n—l / 

nr 

A:-l \ 


2 irk 

ai + 202 COS-h 203 cos 

n 


47rA;\ 

IT/' 


We have obtained here a product which is the same as that in (6.12) except that 
the last factor is missing. The approximation corresponding to (6.14) will 
then be 

\A+B + C + /)]-*’• 


(6.28) 


, ^ 2 ) ^ 


(o + 6 + c)-* ^ 

since we may take the approximation for the product from 1 to n and divide 
by the last factor, (a + h + c). The procedure for finding the first moment for 
X 2 (mean = o) is exactly the same as that outlined for finding the first moment 
of 0 X 2 (mean = 0). We obtain 


(6.29) 


— ^ 
n 

- 1 
+1 

• 

Case III. 

If we set r = 2, m = 1 

in Xr, 

tn we have. 


Ooo 

Ooi 

Oo2 



Ool 

Ooo 

Ool 

Ooo 

(6.30) 

^2/n 1 O02 

A2 ,i = - 

Ool 

Ooo i 
2 

_ ^ 



CLOO 

Ool 




Ool 

Oco 



To find the moments of X 2.1 let us consider the following distribution, 

(6.31) (vs;)' 

in which 0 represents the population value of the serial correlation coefficient. 
The moment generating function for the joint distribution of aoo/2flr^, aoi/2<r* 
and 062 / 20 -* will be 


0^6,)= /;• •/ exp {i:[(xv 


x) 


/3(a:„+i — a;)]“ 
) II dXa 


— Ooo ^0 Ool ^1 — 

4" aoi(—2/3 — Oi) + ao2(—^ 2 ) 1)11 


dXa. 


(6.32) 
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This function is very similar to (6JH). The approximation to , fli) 

here will be exactly the same as that obtained in (6^8) for , ft) except 

that here o=sl + /3* — ft,b« —2/8 — ft , c == —ft . For the case where the 
mean is zero, we find the approximation (6.14) in which a, 6, and c have the 
above values. 

We may obtain the first moment of X^.i by operating on the function 
^,i(ft, ft, ft) proceeding as follows; (a) compute (6.11) as before and set ft = 0, 
(6) integrate from — oo to ft + ft with respect to ft + ft == f from — oo to ft — ft 
with respect to ft — ft = € (at this point we will have the first moment of the 
third order determinant divided by the second order determinant), (c) differen¬ 
tiate with respect to ft, (d) repeat step (b), and set ft and ft » 0. 

The first two steps for obtaining the first moment of 0 X 2.1 (mean = 0) were 
performed for the first moment of 0 X 2 so that we may perform step (c) on (6.22). 
We obtain 


(6.33) 


+ (a* - fe*)*)]-*”-* 
4(a* - i»»)i 


and finally by step (d) we have 


(6.34) EMn = --2^ [i(a + (o* - b*)*)]-*""*, 

n "t* 1 

in which a == 1 + ^ and b = —2/3 since ft and ft have been set equal to zero. 
Substitution of these values in (6.34) shows that it is independent of j8, and we 
find 

(6.35) -• 

n -r 1 

Using (P 2 ,i{Bq , ft , ft), the generating function for X 2 ,i (mean = a), we find 

(6.36) Ei\l{r) = . 

n 

The procedure for obtaining the second moments of the above criteria consists 
essentially of performing twice the operations prescribed for obtaining tire first 
moments. The details given in connection with the first moment are sufficient 
to indicate the procedure. The details for the second moments are too com¬ 
plicated algebraically to list here. Table I indicates the second moments ob¬ 
tained as well as other moments obtained in the earlier parte of the paper. 


7. Serial correlation in several variables. Given a sample of n observations 
on each of k variables xta, i = 1, • ■ • , k, we shall assume they are distributed 
as follows: 


(7.1) 


dPn 


A*" 

(2t)*"* 




We wish to test the hypothesis Hm that there is no serial correlation, i.e., that 
i>< = 0, t = 1, • • • , fc. For this purpose let us define the space S2 and a as follows: 


II Aij II pos. def. —<*> < Oi < 


0 . 


(7.2) 


Q: 

la: 
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TABLE I 


X 

formula no. 

■E{x) 

E(a^) 


ho 

(2.11) 

0* 

1 

n + 2 

1 

71 + 2 

b 

(2.6) 

-1 

1 

n(n — 3) 

W — 1 

n + 1 

(n - l)»(n + 1) 

otfi 

(5.3) 

2 

4(n + 3) 
n + 2 

4 

n + 2 

m 

(5.6) 

2n 

4n(w + 3) 

4n(n — 3) 

n — 1 

(n - l)(n + 1) 

in - !)*(« + 1) 

V2 

(5.8) 

3/1 “h 2 

9n" + 23n + 12 

2n^ + 7» + 4 

n + 1 

(n + l)(n + 2) 

in + l)*(n + 2) 

oX] 

(2.10)ff. 

n + 1 
n + 2 

(n + l)(n + 3) 

(n + 2)(n + 4) 

2(n + 1) 

(n"+ 2)2(n + 4) 

Xi 

(2.10) 

n 

n + 1 

n(n + 2) 

(n + l)(n + 3) 

2n 

(/t + l)*(n + 3) 

0 X 2 

(6.9)ff. 

n 

71 + 2 

n 

n + 4 

4n 

in + 2Kn + 4) 

X 2 

(6.9) 

n — 1 
n + 1 

n — 1 1 

71 + 3 j 

4 (n - 1) 
in + lyin + 3) 

0 X 2,1 

(0.30)ff. 

n 

7l(7l + 2) 

2n 

n + i 

(w + l)(n + 3) 

in -f 1)=(« + 3) 

X2,1 

(6.30) 

n — 1 

n 

(n — l)(n + 1) 
n(n + 2) j 

2(n - 1) 
n®(» + 2) 


The mean of ho and b were also obtained by Anderson [8]. 


The development of the appropriate X criterion for this case parallels very closely 
the development of the X criteria in multiple regression analysis. Th(^ criterion 
obtained for testing the hypothesis Hkn is 


(7.3) 


2/n 


X^C 


Oij bij 
bjj ajj 


I o.ii P 



where 


Uij — (iTfa ^/)> 

a 

bij = i[2D (^*« *4“ 

The probability theory for the X criteria in (7.3) remains to be developed. 
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8. Summary. A problem in serial correlation which has I’eceived considerable 
attention is that of devising a statistic for indicating the presence of a relation 
between successive observations, i.e. a lack of independence of the order in which 
the observations were drawn. Von Neumann developed the distribution and 
moments of the ratio of the mean square successive difference to the variance. 
R. L. Anderson presented the distribution of a serial correlation coefficient which 
is the ratio R = SXaXa+i/SXa (Z > 1, subscripts reduced mod n). 

The present investigation was undertaken with the object of developmg the 
likelihood ratio functions for testing various hypotheses connected with serial 
correlation in one or more variables and determining the moments and in some 
cases the distributions of these likelihood ratios. 

The variates are considered to be ordered by their subscripts a = 1, • • • , n. 
The introduction of = xi , Xn +2 = etc. is made to obtain a symmetry 
which greatly simplifies the problem. 

The likelihood ratio criteria were developed for testing the hypotheses 

a) that Xa is independent of Xa^i 

b) that Xa is independent of Xa+u , i = 1, • , r; 

c) that Xa is independent of some subset of the .r« 4 .z» 

d) that in the case of several variables Xia, i = 1, • • • , fc, or = 1, • • • , n 

the ^ = 1, • • * , are independent of the Xi^a-^-h . These criteria are shnilar 

in form to those obtained in regression analysis. 

The likelihood ratio criterion for testing the hypothesis a) turns out to l^e 
X = (1 — where R is the function given above. The moments of R are 

obtained and from these the moments of These moments are found to 
agree witli those of a Pearson Type I curve to n/2 moments. A simple trans¬ 
formation gives us the moments of a ratio differing from that used by von 
Neumann by the addition of the term {xn — xif to the numerator. A simplifica¬ 
tion of the moments is attained by this change. In fact, if we denote this 
altered statistic by rj we find that {rj — 2)* is distributed according to a Pearson 
Type I curv^e to n/2 moments. 

The mean and variance were determined for the ratio of the sum of squares 
of the second 8U(*.cessive differences to the first successive differences. 

The mean and variance are obtained for the likelihood criteria for testing 
the hypothesis b) for r = 2, and for testing the hypothesis c) for r = 2 where 
Xa+h is the subset of Xa+ii ; {i = 1, 2). 

All the above moments were obtained under the assumption that the hypothe¬ 
sis to be tested was true. No results have been obtained thus far in cases b) 
and c) for a general r nor for hypothesis d). 

The moments for the several cases above were obtained by the use of moment 
generating functions which, for the criteria used, took the form of the product of 
n terms. In the case a) it was shown that the product could be approximately 
represented by the nth power of a single expression which was equivalent for the 
purpose of obtaining the first n moments. A method was developed for making 
analogous approximations to the generating functions for cases b) and c) since 
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it was not foimd possible to obtain the moments from the products in their 
original form. 

The author wishes to express his gratitude to Professor S, S. Wilks under 
whose helpful direction this paper was written. 
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ON A STATISTICAL PROBLEM ARISING IN THE CLASSIFICATION 
OF AN INDIVIDUAL INTO ONE OF TWO GROUPS' 

By Abraham Wald 
Columbia University 

1. Introduction. In social, economic and industrial problems we are often 
confronted with the task of classifying an individual into one of two groups on the 
basis of a number of test scores. For example, in the case of personnel selection 
the acceptance or rejection of an applicant is frequently based on a number of 
test scores obtained by the applicant. A similar situation arises in connection 
with college entrance examinations. Again, on the basis of a number of test 
scores, the admission or rejection of a student has to be decided. In all such 
problems it is assumed that there are two populations, say wi and vz , one repre¬ 
senting the population of individuals fit, and the other the population of individ¬ 
uals unfit for the purpose under consideration. The problem is that of classifying 
an individual into one of the populations and rz on the basis of his test scores. 
Often, some statistical data from past experience are available which can be 
utilized in making the classification. Suppose that from past experience we 
have the test scores of Ni individuals who are known to belong to population in, 
and also the test scores of Nz individuals who are known to belong to population 
vz . These data will be utilized in classifying a new individual on the basis of 
his test scores. 

In this paper we shall deal with the statistical problem of classifying an in¬ 
dividual into one of the populations xi and X 2 on the basis of his test scores and 
on the basis of past experience, given in the form of two samples, one drawn 
from xi and the other from vz . In the next section we give a precise formulation 
of the statistical problem and state the assumptions we make about the popula¬ 
tions xi and X 2 . 

2. Statement of the problem. We consider two sets of p variates (a^i, • • • , Xp) 

and (yi, • , 2/p). It is assumed that each of the sets (a:i, • • • , Xp) and 
( 2/1 > * • • 1 Vp) has a p-variate normal distribution and the two sets are inde¬ 
pendent of each other. It is furthermore assumed that the covariance matrix 
of the variates xi, • • • , is equal to the covariance matiix of the variates 
2/1 > • * * > 2/p > == {i> 3 = P)« We will denote this common 

covariance by cr*y. Let us denote the mean value of Xi by and the mean value 
of Vi by Vi . Furthermore we will denote the normal population with mean 
values Ml, * * * , Mp and covariance matrix |1 <r*y 1| by xi, and the normal popula¬ 
tion with mean values vi, • • • , Vp and covariance matrix |1 aij || by X 2 . 

A sample of size iVi is drawn from the population xj and a sample of size JVz is 

1 The author wishes to thank Dr. Irving Lorgc, Columbia University, for calling his 
jittention to this problem. 
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drawn from the population t 2 . Denote by Xia the a-th obaervation on Xi 
(i = 1, • • • , p; a = 1, • • • , Ni) and yifi the jS-th observation on 2 /» (t = 1, • • • , p; 
13 == 1, • • , Ni). Ijet Zi (i = 1, • • • , p) be a single observation on the i-th 
variate drawn from a p-variate population ir, where it is known a priori that ir is 
either identical with iri or with ira. The set ( 21 , * • • , Zp) is assumed to be dis¬ 
tributed independently of (xi, • • • , Xp) and ( 2 / 1 , • • ^ , 2/p)* 

We will deal here with the following statistical problem: On the basis of the 
observations Xia , Vip y Zi {i — 1, • • , p; a = 1, • • • , ATi ; = 1, • • • , N 2 ) we 

test the hypothesis Hi that the population tt, from which the set ( 21 , • • • , Zp) has 
been drawn, is equal to ti . The parameters , • • • , iip, I'l, • • • , Vp and 
II <r,y|| are assumed to be unknown. 


3. The statistic to be used for testing the hypothesis Hi . In this problem 
there exists only a single alternative hjq)othesis to the O-hypothesis Hi to be 
tested, i.e. the hypothesis H 2 that v is equal to Tra. If the parameters mi 1 * * * , 
Mp > *' 1 ) * * • > and 11 ffij II were known we could easily find (on the basis of a 
lemma by Neyman and Pearson) the critical region which is most powerful with 
respect to the alternative Ha. Let us assume for the moment that the para¬ 
meters Ml > * * * > Mp , J'l j * * • , J'p and II aij || are known and let us compute the 
critical region for testing Hi which is most powerful with respect to the alterna¬ 
tive H 2 . According to a lemma by Neyman and Pearson^ this critical region is 
given by the inequality 


( 1 ) 


2 ^2(2 1 , • * * y 2 p ) ^ , 
— ,^p) “ ^ 


where pi ( 21 , • • • , Zp) denotes the joint probability density function of 21 , • • • , 2 p 
under the hypothesis Hi, p 2 {zi , • • • , 2 p) denotes the joint probability density 
function of ( 21 , • • • , Zp) under the hypothesis H 2 , and k is a constant determined 
so that the critical region should have the required size. 

Denote the determinant value | <7,y | of the matrix || an || by Then 


1 -i S S 

(2) Pi(2i, • • • , 2p) = ® 


and 

1 -I S S 

(3) ps(«i , ,Zp) - (2x)prto- ® ’ 

where the matrix || || denotes the inverse matrix of the matrix |1 II • Tak¬ 

ing logarithms of both sizes of the inequality (1), we obtain the inequality 

(4) -MSS - •’0(2) - "j) - (2i - M<)(2i - Mi)]} > log k. 


* J. Netman and E. S. Pearson, ‘‘Contributions to the theory of testing statistical 
hypotheses,'^ Slot. Res. Mem., Vol. 1, London, 1936. 



A STATISTICAL PROBLEM 


147 


Multiplying both sides of (4) by 2, we have 

(6) 23 23 — M<)(*y — My) — — V{)(zj — i»y)] > 2 log fc. 

J * 


The critical region (5) is most powerful with respect to the alternative Ht , but 
it cannot be used for oiir purposes since the parameters mi , • * * , i J'l > * * * j J'p 
and II cTij II are unknown. The optimum estimate of crij on the basis of the ob¬ 
servations Xia and yifi is given by the sample covariance 


( 6 ) 


ATI N2 

Z (a:.-. - £iKxia - ^y) + Z (j/.fl “ J/.Oto - Vf) 

a-I 

Nr+Ni-2 . . 


^«a ViQ 

where Xi = “ -— and Vi = —. 7 — • The optimum estimates of m» and Vi are 

Ni N 2 

given by Xi and yi respectively (t = 1, • • • , p). Hence for testing Hi it seems 
reasonable to use the statistic R which we obtain from the left hand side of (6) 
by substituting the optimum estimates for the unknown parameters. Thus R is 
given by 


( 7 ) « = 23 2: S ^[(z% Xi)(Zj Xj) {Zi yi)i^j ij)]} 

3 t 


where || s’^l = || s.•ylr^ The critical region for testing Hi is given by the in¬ 
equality 

(8) R>Cy 


where C is a constant determined in such a way that the critical region should 
have the required size. It is interesting to notice that R is proportional to the 
difference Tl — T\ where Ti (i = 1, 2) denotes the generalized Student's ratio* 
for testing the hypothesis that the set ( 21 , • • • , Zp) is drawn from the population 
TTi . In our case the statistic Ti cannot be used for testing Hi , since Ti is appro¬ 
priate for this purpose if the class of alternative hypotheses contains all p-variate 
normal populations having the same covariance matrix as tti . In our case the 
class of alternatives consists merely of a single alternative, namely, the alterna¬ 
tive r 2 . 

For the sake of certain simplifications we shall propose the use of a statistic U 
which differs slightly from the statistic R. In order to obtain U, we consider the 
inequality (5). Since this inequality can be reduced to 

(9) Z Z - My) > k', 

i * 

where k' denotes a certain constant. The statistic U is obtained from the left 
hand side of (9) by substituting the optimum estimates for the unknown para- 


* See. in this connection H. Hotelling, ‘The generalization of Student’s ratio,” AnnaU 
of Math. Stai.y Vol. 2, and R. C. Bose and S. N. Roy, “The exact distribution of the Stu- 
dentized D* statistic,” Sankhya, Vol. 3. 
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meters. Thus 

(10) U = - £j), 

and the critical region is given by the inequality 

(11) U >d, 

where the constant d is chosen so that the critical region should have the re¬ 
quired size. The statistic U differs from R merely by a term which does not 
depend on the quantities zi, • • • , Zp. If Ni and N 2 are large the difference 
U — Ri& practically constant and therefore the critical regions (8) and (11) are 
identical. The use of U seems to be as justifiable as that of R and because of 
certain simplifications we propose the use of the critical region (11). 

The statistic U is closely connected with the so called discriminant function^ 
introduced by R. A. Fisher for discriminating between the two populations tti 
and ira. The discriminant function D is given by 

(12) D = hidi + bada -j- • • • -f- bpdp 

where d* = yi — x* and the coefficient 6* is proportional to ^ s'^dj . The co- 
efficients fei, • • • , bp are called the coefficients of the discriminant function. 
We see that U is proportional to the statistic ^ biZi which is obtained from the 
right hand side of (12) by substituting Zi for d<. 

4. Solution of the problem when Ni and N 2 are large. Denote by F(U, Ni , 

Ni I TTi) the cumulative probability distribution of U under the hypothesis that 
the set (zi, • • • , Zp) has been drawn from the population tt* (i = 1,2). If Ni 
and Ni approach infinity the distribution F({7, iVi, iVa | Ti) converges to a normal 
distribution, since the variates s»,-, Xi and yi converge stochastically to the con¬ 
stants Hi and Vi respectively (i, j = 1, • • •, p). Let us denote lim F(17, 

Niy Ni\ Vi) by I Vi) (t == 1, 2). Furthermore denote by a,- the mean value, 
and by ai the standard deviation of the distribution | Vi) (i = 1, 2). It is 
obvious that ai = ca = o’ (say). It is easy to verify that the variates 

(13) ai = SSs*^f<(j/y - ii), 

(14) aj = 218*^Myi - Si), 

8''‘s’‘{pk — Xk){Si — Xi)Sij 

, ^ <-l j-l *-l 1-1 

(16) 

= ^ ^ “ *t)(yj — Si), 

converge stochastically to the constants ai, aa and o’® respectively. 

♦ R. A. Fisher, statistical utilization of multiple measurements,*’ Annals of Eugen- 
ic 8 f 1938. 
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Hence for large values of Ni and Ni we can assume that U is nwmally ihs- 
tributed with mean value and standard deviation 9 if the hypothesis Hi (i ■> 
1 , 2) is true. Thus the critical region for testing Hi is given by the inequality 

(16) U >&i + \9, 

1 r* 

where the constant X is chosen in such a way that J ^ is equal to the 
required size of the critical region. 

Finally, some remarks about the proper choice of the size of the critical region 
may be of interest. Two kinds of error may be committed. Hi may be rejected 
when it is true, and Hi may be accepted when H 2 is true. Suppose that Wi and 
W 2 are two positive numbers expressing the importance of an error of the first 
kind and an error of the second kind respectively. If the purpose of the statisti¬ 
cal investigation is given it will usually be possible to determine the values of 
Wi and Tr 2 . We shall deal here with the question of determining the size of the 
critical region as a function of the weights Wi and W 2 . Denote by Pi the prob¬ 
ability that (16) holds under the assumption that Hi is true (i = 1, 2). Then 
Pi is the size of the critical region (also the probabiUty of an error of the first 
kind), and 1 — P 2 is the probability of an error of the second kind. Both prob¬ 
abilities Pi and P 2 are functions of X and are given by the following expressions: 

(17) 
and 


(18) 


V 2ir J (( 5 i—a2)/^)+X 


From (13) and (14) we obtain 

( 19 ) 0:2 ® '(^t “ ^.')(^i ^j)* 

i i 

Since the right hand side of (19) is positive definite, we have a* > oi. Hence 
because of (17) and (18) we also have P2 > Pi. By the risk of committing a 
certain error we understand.the probability of that error multiplied by its 
weight. Hence the risk of committing an error of the first kind is given by WiPi, 
and the risk of committing an error of the second kind is given by Tr»(l — P*). 
It seems reasonable ta choose the value of X so that the two risks become equal 
to each other, i.e. such that 

(20) WiPi = W 2 O. - P*). 


Hence using (17) and (18) we obtain the following equation in X 

y.(( 5 ,-a,)/.«+x 


( 21 ) 


1 /.« 1 .(( 5 ,-a,)/.«+x 
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Using a table of the normal distribution, the value of X which satisfies the equa¬ 
tion (21) can easily be found. For Wi — Wi the solution of (21) is given by 


X = 



and the critical region is given by the inequality 


U > ai -|- X# = fii -H 


ai -1- *2 
2 


6. Some results concerning the exact sampling distribution of the statistic U. 

If Ni and N 2 are not large the solution given in section 4 cannot be used and it 
is necessary to derive the exact sampling distribution of U. Let 


(22) (y.- a;,) ^^ - 2.- (t - 1, • • •, p). 

Then 

(23) ^ = /|/S E 

where the variates Zi, — • y Zp are distributed independently of the set ( 2:1 , • • • , 

Zp), the mean value of Zi is equal to (v* — /ui) A/ and the covariance 

Y A'l A/a 

between Zi and Zj is equal to aij. It is known that the set of covariances Sij is 
distributed independently of the set ( 21 , • • • , , zl, • • • , z'p) and therefore the 

distribution of U remains unchanged if instead of (6) we have 


(24) 


s.. = «r_L.r (n = + ATa - 2 ), 


where the variates Ua are distributed independently of the set (zi, — • , Zp, 
1 • * * > l^ve a joint normal distribution with mean values zero, cr^ = (t^ 
and = 0 if a /3, It is necessary to derive the distribution of U under 
both hypotheses Hi and i/a. In both cases the mean values of 21 , • • , , 

zj, • • • , Zp are not zero. Instead of U we will consider the statistic 

17 ' == 2 ^ s'^ZiZj 

«-i f-i 

which differs from U only in the proportionality factor a /. The distri- 

Y Af 1 Ar 2 

butions of U' under the hypotheses Hi and H 2 are contained as special cases in 
the distribution of the statistic 

(25) F = S S ^i.n+2 I 
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W' 


%liere is pven by (24) and the joint distribution of the variates 
(i x= 1, •. • , p; |8 = 1, • • • , n 4- 2) is giv^ by 


(26) 


1 p j» r * * 1 

_ t - e L«-i J 

(2T)*»(’H-a)/2^n+2 ^ 

14+2 


14+3 p 

xEZdh. 


/S-1 


The quantities $i, • • * , {p, »?i, • * * , ijp are constants and denotes the de- 
ternlinant value of the matrix || <r</||. 

We will deal here with the distribution of the statistic V given in (26) under 
the assumption that the joint distribution of the variates ii» {i — 1, • • • , p; 
jS = 1, • • • , n + 2) is given by (26). 

In order to derive the distribution of V we shall have to prove several lemmas. 

Lemma 1. Let \\ \ij || (t, j = 1, • • * , p) an arbitrary non-singvlar matrix^ 
and let 


ti0 


i hj tj0 


(t = L • • • , p; /3 = 1, • • • , n + 2). 


Let furthermore be given by 




tt—i 


n 


Then 23 ]C '^^^^t,n+l^i.n42 — n+it'j,n+ 2 , i.e. the stcUistic V is invariant 

i i i * 

under novrsingular linear transformations. 

Proof. We obviously have 


(27) 


^<.n+l tj,n+2 


-tt 


Xtfc ^st tk,n+\ t[,n+2 , 


Furthermore we have 

¥ 

(28) 


Sij — ^^ ■ 

M 1-1 


Hence 

(29) lls'yll = ||X,yll|hy||||Xiill 

where Xu = Xy<. 

From (29) we obtain 

(30) lls'^'ll = II IIII mix-II, 

and therefore 




( 31 ) 
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Hence from (27) and (31) we obtiun 

(32) S E<U+i </.»+» = E £ E E E E x''«“ ^* 1 . <».n+i <..»+*. 

j i i i k I u V 

The coefficient of ^,»+i^,n +2 on the ri^t hand side of (32) is given by 

(33) EE EE x'"x'^«*'x.„x^ - EE {(Ex“x<«)(Ex"x^,)«“} = g-. 

j i k I k I i j 

Lemma 1 follows from (32) and (33). ^ 

Lemma 2. The distribution of V remains unchanged if we assume that the co- 
variance matrix || 1| is equal to the unit matrix^ i.e. the joint distribution of the 
variates ti^ (i ^ 1, • • • , p; = 1, • • • , n + 2) is given by 


(34) 


1 -sF 2 S do+S «<,n+l-P*)*+S (<i,n+ 2 *-r<)* I 

-g «-i * »• J, 

(2ir)P^"+2^^2 ^ ’ 


where the constants pi and f» are functions of the constants fi, • * • , fp , , • • • , 

and of the an . 

Lemma 2 is an immediate consequence of Lemma 1. Hence we have to derive 
the distribution of V under the assumption that the variates have the joint 
distribution given in (34). 

Let Ui (i = 1, • • , p) be the point of the n + 2 dimensional Cartesian space 
with the coordinates , • • • , ^,n+ 2 . Let P — (wi, * • • , Wn+a) and Q = 

n+2 

(vi, • * *, Vn-\. 2 ) be two arbitrary points such that X/ ~ 0 and = 1- 

^-1 

Denote by 0 the origin of the coordinate system and let ?i,n+i be the projection 
of the vector ORi on the vector OP, We have 

n-f-2 

(35) ~ X) tifiUfi (i = ly * * * > p)« 

s -1 

Similarly, the projection ti,n ^2 of the vector 0P< on OQ is given by 

n-f2 

(36) ti,n^2 ~ X3 i'ifi ' 

S -1 

Let Ri {i ^ L * * * > p) be the projection of the point Ri on the n«dimensionai 
hyperplane through 0 and perpendicular to the vectors OP and OQ. Denote 
the coordinates of Ri by rn ,•••., ri,n +2 respectively and let hj be defined by 

n+2 

E ri$rg, 

(37) 


Sij = 


If we rotate the coordinate system so that the (n + l)-axis coincides with OP 
and the (n + 2)-axis coincides with OQ, and if ?<i, • • • , ?<,n +2 denote the co¬ 
ordinates of Pi (t = 1, • • , p) referred to the new system, then we have 

(38) Sij = —• X) ^70 ~ "" Xi) > and 


n (j-i 

n+2 


n 

n+2 


E = E 

«-i 


( 39 ) 
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From (38) and (39) we obtain 


*•+2 


(40) 


““ &.n4-l?/.n+l — ?t,n+2?i,n+2 


% = 


4-1 


n 


We will now prove 


Lemma 3. Let V be defined by 

(41) .n+l i^i.ii+2 > 

i i 


where , h,n ^2 and §,*, are given by the formttlm (35), (36) and (40) reepec'- 
tivdy. Let furthermore the joint probability diatribuiion of the variates (i = 
1, - , p; I3 - 1, • • • , n + 2) be gwen by 


(42) 


1 

(2ir)*’^^+2)/2 ^ 


if P »+i 

-5 S S 








Then the distribution of V calculated under the assumption that the quantities 
, • * • , Wn 42 , , * • ’ , «^n +2 ore coustants and the joint probability distribution of 

the variates is given by (42), is the same as the distribution of V calculated under 
the assumption that the joint probability distribution of the variates Ufi is given by (34). 

Proof. If we rotate the coordinate system so that the {n + l)-axis coincides 
with OP and the (w + 2)-axis coincides with OQ, and if > * * * > denote the 
coordinates of Ri (t = 1, • • • , p) in the new system, then f.-.n+i and ?<,n 42 are 
given by the right hand sides of (35) and (36) respectively. Furthermore 

n 

tiatja 

s a-1 


Hence the distribution of F is certainly the same as that of V if the 
joint probability distribution of the variates Up {i = = 

1 , • *., n + 2) is given by the expression which we obtain from (34) by sub¬ 
stituting Ufi for tifi . Thus, in order to prove Lemma 3 we have merely to show 
that if the variates tip have the joint probability distribution (34), the variates 
Up have the joint probability distribution (42). Since the variates , • • • , ^.n +2 
are obtained by an orthogonal transformation of the variates ?<i, • • • , ?t,n+ 2 , 
it follows that the variates Up (i ^ 1, • • • , p; = 1, • • , n + 2) are inde¬ 
pendently and normally distributed with unit variances. We have 

n+2 

( 43 ) tip =» hpyliy 

7-1 

where \py is equal to the cosine of the angle between the fi-th axis of the original 
system and y-th axis of the new system. Since 

hp,n^i = Ue and hp.n^z = vp , 
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and since jB(fc>) == 0 for 7 =* 1, * • * , n, £?(?<,n+x) *= pi and « f <, it follows 

from (43) t^t 

(44) E{ti^) « . 

Hence Lemma 3 is proved. 

We will now prove 

Lemma 4. Let P he a point with the coordinates , • • • , Un +2 and Q a point 
with the coordinates Vi, • • • , Vn +2 such that ~ 0 and 2 ul = 2 vl = 1. Denote 
by Lp the flat space determined by the vectors 0i2i, • • * , ORp (Ri = (<<i, * • • , ^\n+j)) 
and let P be the projection of P on Lp and Q the projection of Q on Lp , Denote 
furthermore by dx the angle between the vectors OP and OP, by $[ the angle between 
OP and OO, by ^2 the angle between OQ and OQ, by 62 the angle between OQ and OP, 
and finally by Bz the angle between OP and OQ, Then the statistic V defined in (41) 


is equal to 





0 

Oi 

02 


bl 

Oil 

012 

(45) 

p- __^ 

012 

O22 



Oil 

O12 



012 

O22 

where 





(46) ai 


(47) 


Proof. If we rotate the coordinate system in such a way that the (n + 1)- 
axis coincides with OP and the (n + 2)-axis coincides with OQ, and if 
, ?t.n +2 are the coordinates of Ri in the new system, then 

m. . .VmI 

“ n 

According to Lemma 1 the statistic V is invariant under linear. transfoi:mation8 
of the variables . Hence is also invariant under linear transformations of 
the variables . Thus the value of V remains unchanged if the points 
Pi, • • • , Pp are replaced by arbitrary points Pj, • • • , Pp of Lp subject to the 
condition that the vectors OPI , • • • , OPp be linearly independent. Hence 
we may assume that the vectors OPs, • * • , OPp are perpendicular to each other 
and lie in the intersection of Lp with the n-dimensional flat space which goes 
through 0 and is perpendicular to OP and OQ. Furthermore we may assume 
that Pi = P and P 2 = Q. Then 0P» is perpendicular to OP, OQ, OPi and OP 2 
(i = 3, • • • , p). 


== cos $1 ; 


* e , ^ al - bl 


2 « 2 12 

___ cos $2 02 — 62 


and an == 


cos ^1 cos 02 cos ^8 — ai02 — b\b2 



A STATISTICAL PROBLEM 

The statistic V can obviously be written in the form: 

0 ?l.n+l • • • ?p,n4-l 
5ii • • • Sip 


jj?" _ ^ Spi 

Sii •• 


Because of our choice of the points /?i, * • • , iZp, we have 


~ ^i,n +2 — 0 


= 0 if i 9^ j 


(t — 3, ‘ , p;y = 1, 


0 

&.n+l 

k,n^l 1 

^1 

1 

5ll 

§12 

U,n+2 

Sit 

§22 


1 §11 §18 1 



OpI 

to 


&.n+l 

II 

t 4-2 = 


From (49) and (50) it follows that S<y = for i 7 ^ j except S 12 which is not neces¬ 
sarily zero. Hence F reduces to the expression 


F = - 


For any two points A And B denote the length of the vector AB by AB. Since 

WSii + (?l,n4l)^ + (?l,n42)* = OP^i ^§22 + (fe,n+l)^ + (?2,n+2)* = ^^12 + 

+ ti,n+2l2,n+2 = 0 P* 0 <? cos $2 y wc Can cssily verify that Su = an, 
§12 = ai2 and S22 = 022. Hence Lemma 4 is proved. 

The angles $[ and 62 can be expressed in terms of the angles 61 , $2 and ds , 
In order to show this, let us rotate the coordinate system so that the first p 
coordinates lie in the flat space Lp defined in I^emma 4. Let , • ■ • , Wn +2 be 
the coordinates of P and vj , • • • , t;n 42 the coordinates of Q referred to the new 
axes. Then, since OP = OQ == 1, we have 

-%/”'2 I TTTz /i' u[v[ + * •• + UpVp _ 


cos di = ui + 
cos $2 = + 


+ Wp; 


cos di = 


cos $2 ~ 


' ' I 

Mlt>i 4 • • 


4 UpV p 

• • 4 Vp* 
4 

• 4 uf 


cos 6 t = 


' ' I I ' ' 

UiVi 4 ••• 4 UpVp 


Vu? 4 • • • 4 UpVvi^ 4 • • • 4 fP 


Hence 


cos $[ = cos Bi cos $2 and cos 62 — cos $2 cos $ 2 . 
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Introducing the notations 

Ml « cos* ft , m 2 = cos* ft and m» = cos ft cos ft cos ft, 

we have 


ai = mi, 


02 = ms, 


On = 

and 


mi *— mi — mj 


n 


61 = 
di2 


ms, ft == ms; 
ms(l — mi — ms) 


oss 


ms — ms — ms 
. n 


Substituting the above values in (45) we obtain 


ms 


= —n 


ms — 1 + mi + wis — mims 

_c os ^1 cos ft cos ft _ 

cos* ^1 cos* ft cos* ft — sin* sin* * 


Hence, Lemma 4 can be written as 

Lend^a 4'. Let P be a point with the coordinates ui, • • , t/n +2 and Q a point 
with the coordinates ui, • • • , i;«+ 2 . Denote by Lp the flat space determined by the 
vectors ORi , • • • , ORp and let P be the projection of P on Lp and Q tho projection of 
Q on Lp . Denote furthermore by di the angle between OP and OP, by ft the angle 
between OQ and OO nnd by ft the angle between OP and OO. Then the statistic V 
defined in (41) is equcd to 


(45') 


^ _ cos ft cos ft cos ft __ 

~ cos* ft cos* ft cos* ft — sin* 0 i «n* ft * 


If P is a point of the (n + l)-axi8 and Q a point of the (n + 2)-axis, then F 
is identical with the statistic V given in (26). Hence we obtain the following 
Geometric interpretation of the statistic V defined in (25). If ft denotes the 
angle between the (n + l)-axi8 and the flat space Lp determined by the vectors 
Ofii, • • * , ORp , ft the angle between the (n + 2)-axis and the flat space Lp , 
and if ft denotes the angle between the projections of the last two coordinate 
axes on Lp , then the statistic V is equal to the right hand side of (45'). 

Denote by S the 2 n + 1-dimensional surface in the 2 n + 4-dimensional 
space of the variables ui, • • • , tin+ 2 , t^i, • • • , Vn -\.2 defined, by the following 
equations 


(52) 


n+2 n-»-2 

^-1 


n-f2 

X = 0 . 


denote by C the 2n -f l-dimensional volume of the surface i.e. 
(63) 


C = f^dS. 
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Now we will assume that wi, • • •, Un^i , vi, • • • , ^^42 are random variables 
and the joint probability distribution function is defined as follows: the point 
(wi, • • * > Wn+*, vi, • * • , t;» 42 ) is restricted to points of S and the probability 
density function of is defined by 


(54) 


C 


Hence for any subset A of S the probability of A is equal to the 2 n + l-dixnen- 
sional volume of A divided by the 2 n + 1-dimensional volume of S. It should 
be remarked that the probability density function (54) is identical with the 
probability density function we would obtain if we were to assume that 
lii, • • • , Un 42 , 1 ^ 1 , • • • , Vn +2 are independently, normally distributed with aero 
means and unit variances and calculate the conditional density fimction undei* 
the restriction that (ui, • • • , Un+ 2 , t'l, * • • , i^n+s) is a point of S. 

Lemma 5. The probability distribution of V defined in (41), cakvMed under 
the assumption that the joint probability density of the variables , • • • , ti» 42 , 
vi, • • • , Vn ^2 tifi (i = 1, • • • , p; jS = 1, • • • , n + 2) is yiven by the product of 
(54) and (42), is the same as the distribution of the staiistic V calculated under 
the assumption that the variables have the joint probability density function given 
in (34). 

Lemma 5 is an immediate consequence of lemma 3. 

Lemma 6 . Let Lp be an arbitrary pdimensional flat space in the n + 2 dimenr 
sional Cartesian spaccy and let Mp be the flat space determined by the first p co¬ 
ordinate axes. Assuming that the joint probability density function of u^, v^, 
tifi {i — 1, • • • , p; iS = 1, • • • , n + 2) is given by the product of (54) and (42), 
the conditional distribution of V calculated under the restriction that the points 
Riy • • • , Rp lie in Lp , is the same as the conditional distribution of V calculated 
under the restriction that the points jBi, • • • , Rp lie in Mp . The point Ri denotes 
the point with the coordinates , • • • , , 

Proof. Let P be the point with the coordinates , • • • , Wn 42 and let Q be 
the point with the coordinates t’l, • • • , r»+ 2 . Let us rotate the coordinate 
system so that the first p axes lie in the flat space Lp , Denote the coordinates 
of P in the new system by , • • • , Un^ 2 , those of Q by vi, • • • , Vn-^ 2 , and those 
of Ri by til y • • - j ti,n +2 {i = 1, • • * , p). Let S' be the surface defined by 

(65) Xuf = = 1 and = 0. 


It is clear that the surface S' is identical with the surface S defined in (52). It 
is furthermore clear that if the joint density function of Wi, • • • , Wn+ 2 , , * • * , 

Vn +2 is given by the joint density function of wi, • • • , Wn+ 2 , I'l, • * • , rn 42 
C 

dS' 

is the same, i.e. it is given by . It can readily be seen that for any given set 

of values , • • • , Un+ 2 , , • • • , Vn+t the conditional joint probability density 

of the variates t'i^ is given by the function obtained from (42) by substituting 
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t’ifi for tifi, for Ufi and Vf for v$, provided that for any given set of 
values ui, ••• , , vi, ••• , t)» 4 s the joint conditional distribution of the 

variates tin is given by (42). Hence, if the joint distribution of Wi, • • • , iin+t, 
«!,•••, Vn+t and tifi (i — 1 , • • • , p; — 1, • • • , n + 2) is given by the product 
of (54) and (42), the joint probability density function of the variates vi, Vf , 
tisd = 1, • • • , p; (3 = 1, • • • , n + 2) is obtained from that of W/j, V/j, ti^ by 
substituting S' for S and tifi for . 

According to Lemma 4', V can be expressed as a function of the angles Bi , 
$2 and 6 ^ defined in Lemma 4'. Each angle Ok (k = 1, 2, 3) can be expressed as 
a function of the variables It is obvious that the value of $h remains 

imchanged if we substitute iifi for Ufi , for up and for Vfi . Hence also the 
value of V remains imchanged if we substitute tifi for Uis , us for and for . 
Lemma 6 is a consequence of this fact and of the fact that the joint probability 
density of the variates ti ^, and Vfi is identical with that of the variates ti 0 , 
U 0 and Vfi . 

Lemma 7. Assuming that, the joint probability distribution of the variates 
Ufi , Vfi , tifi (i = 1, • • • , p; = 1, • • • , n + 2) is given by the product of (54) and 
(42), the conditional joint probability distribution of ui y • •• , , * * * , , 

calculated under the restriction that the points = (<ti, • • • , U^n+ 2 ) (^* = 1, • • • , p) 
lie in the flat space determined by the first p coordinate axes, is given by 

1 »+2 P 

~5 S S (PiUyHiPy)*,. V ,0 

e r-P+l <-l f\Ul , • • * , Un +2 ) > Vn^i) do 

(Sfi) ,11+2 p ' ” ' > 

r -n 2 S 

/ e Y-P+1 -I /(Ml , • • • , Un +2 yViy • • • , Vn-^ 2 ) dS 


where S denotes the surface defined in (52), and f(ui , • • • , Un+ 2 , vi, • • • , Vn^ 2 ) 
denotes the expected value of 


(57) 


rn 

r2i 


rip ‘ 
Ttp 




rpi • • • Tpp 


calculated under the assumption that the joint distribution of the variates Ufi is given 
by (42). 

Proof. Denote by Ri the projection of Ri on the flat space determined by 
the first p coordinate axes, i.e. Ri = (^i, * • • , 0, • • • , Q). Let h be the 

length of Ri , and let U be the distance of Ri from the flat space determined by 
the vectors OiZi, • • • , (i = 2, • • • , p). Then, as is known, 


(58) 


^1^2 • • • l{ = 



rn ••• ru 
r2i • • • r2i 


(i = 1, “ •, p). 


where Vki 


tktMa • 

tt-1 


Til • • • Tii 
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We introduce the new variables 


(59) 




hi 

li 


(i *s 1, - ,p;y = p + 1, • • • , n + 2). 


Then the joint probability density function of the variates uu, , Ua, t*y 
(i = 1, • • • , p; /3 = 1, • • • , n + 2, a = 1, • • • , p, 7 = p + 1, • • • , n + 2) 
is given by 


p *+I 


( 00 ) 


ili **• ^ 2 S ('*a'-P<Wa-f<Va)*+S S iVy)^ 

__ e Lj- 1 «-l »-l Y-P+1 

C(27r)p<»‘“^2)/2 


X (IHId^i-Xn !!(«?.) ds. 

i y 

Substituting zero for tty {i = 1 , • * • , p, 7 = P + * * • > ^ + 2 ) in (60), we 

obtain an expression which is proportional to the conditional joint probability 
density of the variates W/j, 03 = 1, • • • , n + 2 ; t = 1, • • • , p, a = 1, 

• * , p), calculated under the rcstriction that the points /?, (i = 1, • • • , p) 
fall in the flat space determined by the first p coordinate axes. Hence this 
conditional density function is given by 

. n + 2 p 

-2 S 2 iPinyHiVy)\ 

Ae •• • Ip) 


ir p p *1 

X e “Li-i -I d-s n n 


dtia 


where A denotes a constant. The conditional distribution of the variates 
Uff , V(i = 1, • ’ • , n + 2) is obtained from (61) by integrating it with respect 
to the variables ita (^ = 1, • • • , p; a = 1, * * * , p). Because of (58), we see that 
the resulting formula is identical With (56). Hence Lemma 7 is proved. 

Lemma 8. Lei rni = u\ + • • • + ; m 2 = v\ + • • • + «^p , 

niz == u\Vi + • * * + UpVp . 7/ the joint distribution of the variates Wi, • • • , Un^ 2 , 
, *: * , Vn +2 is given by (54), then the joint distribution of mi , m 2 , ms is given by 


— WI 2 ) 


X F,.+2_„(1 — ms)^„ 42 - 


— ms 


’(>/(l - mi)(l 


^ dnii dmt dmt 


vdiere B denotes a constant, 


1 


\ (1 - n 


,2-.(*-3)/2 


(03) /•’*«) = - '/,V and #*«) = - .. 

Proof. Let m[ = Wp+i + • • • + Mn 4 s. ”*2 = + • • • + *'.+ 2 , 
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/ __ tn9 , . 

ma «* Up+iVp^i + • • • + Wn+aVn +2 j \/ m T ^ ^ * Hrst we 

calculate the joint distribution of mi, m®, ffl| mj, mi, i5li under the assumption 
that ui, • • • , w «+2 , ri, * • • , «;n+a are normally independently distributed with 
Eero means and unit variances. This joint distribution is given by 

Fp(mi)Fj»(mt)^p(tfi8)Fn+a-p(mi)F^a^p(mi) 

(64) , / / / 

X ^n+a~p(»^8) dmi dm d^8 dmt dm 2 dnit . 


Hence the joint distribution of mi, m 2 , ma, mi, mi, mi is given by 

^ ^ ^ p(^)^P { - p(m2) 

Vmimamxma Wmima/ 


(65) 


X 4>n+2—p 


( ~- 7n~7 J dmi dm dm% dmi dmi dmi 
V mi ma/ 


Thie required conditional distribution of mi , m, mi& equal to the conditional 
distribution of mi, m 2 , ma obtained from the joint distribution (65) under the 
restrictions mi + mi = 1, m 2 + mi = 1 and ma + mi = 0. Hence if in (65) 
we substitute 1 — mi for mi, 1 — m 2 for mi and — ma for ma we obtain an ex¬ 
pression proportional to the conditional distribution of mi, m 2 , ma. This proves 
Lemma 8. 

Lemma 9. For any point (ui, • • • , Un^z , , * • * , Vn^z) of the mrfiwe S defined 

in (52) (h^ expected value of (57) {calculaled under the assumption that (42) is the 
joint disirtbulion of Ufi) is a function of mi m, ond m? only, where mi , m a/nd m$, 
are defined in Lemma 8, 

Proof. Let || || (a, jS == 1, • • • , p) be an orthogonal matrix such that 


( 66 ) 

and 

(67) 


where 


Let 


X 13 


Ufi 




=7=^-^ (/3 = 1, •••,?) 

+ Uj, 


\yi 


U0 -f- Xv^ 


v^, iVfi + \Vfff 


(|3 = 1, •••, p) 




(68) 



; ' A. '.'lit 

thm tiiee vax^tc^ (<« are isd^pend^tly uid tMnaaUy distrtt>uted^^^^;^^ 
TMaaQoes. Siam for any pdnt of S, EtiUm) P^a + i’<«a > we have berecore^ ^ 
(66), (67) and (88) ; 

EliUt) “0 (♦ “ 1. ' ■ • > P» 7 “ 3, 4, •' •, p), 

JS?(/fl) *“ <«i(/»i, mt , m»), 
and JS?(<«) «* v>ffl(mi, ro*, tbi). 

Hence liie jdnt distribution of the variates <<« (i = 1, • • • , p; a = 1, • • • , p) 
depends merely on Wi, ms and m» . Since Uj = ^ /<«(,•« «= ^ , the ex- 

a-*l «—l 

pression (57) can be expressed as a function of the variables t'ia • IJraoe the 
distribution of the expression (67) depends merely on the parameters m\ , m% i 
and ms. This proves Lemma 9. 

The main result of this section is the following 

Theorem. Let V he the statistic given in (26) and let the joint distribution of the 
variates t^ii - 1, • • • , p; jS « 1, • • • , n + 2) 6e given by (34). Then the prob- 
ability distribution of V is the same as the distribution of 


^mf - (1 -mi)(l -m*) 

where the joint distribution of mi, nh and m$ is equal to a constant multiple of the 
product of the following three factors: the expression (62), the exponential 
^KmiS^}-f- 2 w, 2 p<r<+»n,S{’ 5 ) expected value of 


(70) 


Tn 


Tpl 


rip 


rpp 


<n+ 2 -p )/2 




The expected value of (70) is ccdculaled under the assumption that the variates /<« 
are normally and independently distributed with unit variances and EiU^) — piUa + 

liVa (t = 1, • • • , p; a = 1, • • • , p) where w« = mi, =* m*and]^ = 

a—1 a—l a-al 

m*. The domain of the variables mi , mt and mt is given by the inequalities: 0 ^ 
mi < 1; 0 < m« < 1; —v^mim* < m» < y/m^nt. 

Proof. First we note that the expected value of (70) is a function of mi, 
m* and m» only. Let P be the point with the coordinates «!,•••, tAi 4 », and Q 
the point with the coordinates »i, • • • , v»+j. Assume that the points Ri =* 
(<<i, • • • , L.n+j) (i = 1, • • • , p) lie in the flat space determined by tiie first p 
coordinate axes. Assume furthermore that «ii>i + • • • + Wn+«rn+s = 0 and that 
the lengths of the vectors OP and OQ i«e equal to 1. Then 

cos ffi = Vu* -1- •••-!- Mp; cos ft =» Vr* + • ■ • ■¥ v\ 


and 


__ u tVi 4" • * * "f" apyp _ 

u 1 + • • * + wp Vi ‘+ • • • + 


cos $9 
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wherB denotes the angle between OP and the flat space Lp determined by the 
vectors ORi , ; • • , ORp ; 6s denotes the angle between OQ and Lp, and 6$ denotes 
the angle between the projections of OP and OQ on Lp . According to Lemma 
4' the statistic V defined in (41) is equal to 




cos Bi cos 6s cos 6s 

cos^ 6i cos^ 6t cos* 6s — sin* 6i sin* 6s 


^ — (1 — mi)(l — ms) 

where ^ 

(72) mi =* cos* + • • * + Wp , ms — cos* 6s — vl + • • • + v\ 

and Ms = cos 6i cos 6s cos 6s = UiVi + • • • + Uj^p . 

It follows from Lemmas 5 and 6 that the distribution of T is the same as the con¬ 
ditional distribution of V calculated under the assumption that the unconditional 
joint probability density of the variates and is given by the product of 
(54) and (42) and imder the restriction that the points P» (f = 1, • * • , p) fall 
in the flat space determined by the first p coordinate axes. Since 




S (piUy+fiVy)* ^ 


is a constant multiple of 

J(wiSpJ+2iiit2pif<+mt2Jf5) 


from Lemmas 7, 8 and 9 it follows readily that the joint conditional distribution 
of mi = til + • • • + ttp , m 2 = i;* -f • • * + Vp an^ mg = UrVi + ... + UjjUp is 
equal to a constant multiple of the product of (62), (73) and the expected value 
of 70. This proves our theorem. 


It can be shown that the variates mi, m 2 and mg are of the order - in the 

n 

probability sense. Hence 


m\ — (1 — mi)(l — m 2 ) 


nm3(l + c) 


where « is of the order >-. Hence we can say: even for moderately large n the die- 
n 

tribution of the atatistic V ia well approximated by the diatribvdion of nmg, where 
the joint diatribuiion of mi, m 2 and ms ia equal to a conatarU multiple of the product 
of (62), (73) and the expected value of (70). 

If n -f 2 — p is an even integer, the expected value of (70) is obviously an 
elementary function of mi, ms and mg. Hence, if n -f 2 — p is even, the 
joint distribution of mi, m 2 and mg is also an elementary function of mi, mg and 

mg. 

If the constants pi and f,- {i = 1, • • • , p) in formula (34) are equal to zero, 
the expected value of (70) is a constant and the joint distribution of mi, mg and 
mg is given by (62). 



ASYMPTOTIC DlSTRlBUtlON OF RUVS AlO) DOW' 

Bt J/WoLFOwnz 
Colimbia UnwerMty 

1. Mtroductkm. Let oi, oj, • • •, a» be any n unequal numbers and let 
8 — {hijht, ••• , K) he & random permutation of them, with* each permutatlim 

having the same probability, \duch is therefore Let R be the sequence of 

Hi 

signs (+ or —) of the differences hi+i — hi (i 1, 2, • • •, n — 1). Tlien R 
is abo a chance variable. A sequence of p successive + (—) signs not imme¬ 
diately preceded or followed by a + (—) sign is called a run up (down) of length p. 
The term “run” applies to both runs up and runs down. As an example, if 
S = (4 6 23 5), then in A » (-f —I- +) there are three runs, one up of length 
one, one down of length one, and one in) of length two. 

The purpose of this paper is to establbh several theorems about the limiting 
distribution^ of a class of functions of runs up and down. These results are 
applicable to certain techniques which have been employed in qu^ty control 
and the analysis of economic time series. They are also shown to apply to a 
large class of “runs.” 

t 

2. Joint distribution of runs of several lengdis. Let rp be the number of runs 
of length piaR and r'p the number of runs of length p or more in R, Then Vp 
and Tp are chance variables. The expectations Eirp) and E(jrp), the variances 
ff*(r,) and <r*(rp), and the covariances (r(rp,rp,) are given by Levene and Wolfo- 
witz [1]. They are all of the order n. Let 


Vp 


Tp - Eixp) 

Vn ’ 


,/ _ ^'p - ^(^p) 

Vp -7=— • 

V n 

Our first results are embodied in the following theorem; 

Thbobem 1. Let I be any non^negalive integer. The joint di^ribation of 
yt, • • • ,yi, y'a+i), approaches the normal distribidion as n —* ». 

We shall give the proof for the case 1. = 1, but it will easily be seen to be 
perfectly general. 

Let Xpi = 1 if the sign (+ or —) of h,+i — hi is the initial sign of a run of 
length p, and let Xpi » 0 otherwise. Let Wp{ = 1 if the sign of hi+i — hi is the 


* Part of the results of this paper was presented to the Institute of Mathematical Statis¬ 
tics and the American Mathematical Society at their joint meeting in New Brunswick, 
N. J., on September 13,1943. 
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initiai sign of a run of iongth p or more, and let Wpt = 0 otherwise. Let Xpp ^ 
Wpn * 0. Then 

n 

n = X *M. 

n 

rj = 2 «*« • 

Now write a = n*, /3 = n*, and consider the sequences 

h(^i)a+i, I *' * > 0* ** 1> * ’ * > 

(Strictly speaking, we should employ the largest int^r in a. Since what is 
meant is clear and since we are dealing with an asymptotic property, we shall 
onut this useless nicety.) Let Xp{ and Wpi have the same definitions relative to 
each of these sequences that and Wpi have relative to the sequence S. The 
accented and unaccented x’s and w's are not always Hie same, because the 
partitioning of the sequence S sometimes breaks up runs and creates others. 
Thus we might have = 1, but Xp„ always = 0. 

It is easy to see that there exists a positive number d such that 

n 

23 1 Xu - Xw 1 < d/S, 

» <-i 

23 I Wm - Wh I < d/s. 

If, therefore, we define 


Zl 


22 


[xxi — E{xii)] 

i-il 

Vn 

X} [«4< - E{W2i)] 

t-l_ 

y/n 


we have 


I - yi I < 


2^ 

y/ n 


I ' 'I ^ 2d^ 


and 


d0 

y/n 


0. 



Bmoe, if Uie ]bsi^t limitiiig dis^ribu^oa of si aod «s is nomal^ao is ilttt 
and y\. 

The duince variables 


ru * 2- 

/ 

Xu 



i'a 


W 

II 

Wu 




(i * 1.2, • • ^ 


have rile same joint distribution for all values of 3 . For x'u and wu,{(j—l)a< 
i < j«), depend only on the relative magnitude of the elments of the sequence 




»» 


not upon the particular values which the elements take, and all permutarions 
of the sequence have equal probability. Clearly rn and r'tj are independent, in 
the probability sense, of n,/ and rjy> (j f), because of the definitions at xu 
and wti. (However, ny and rsy are not independent, because x'u and w'u cannot 
both be 1.) From the results of [1] it follows that for sufficiently large n the 
absolute value of the correlation coefficient between riy and r’tj is less than a 
number smaller than 1. By the methods of [1] it can easily be shown that the 
ratio of the fourth order moments of Viy and r'a about their means to the square 
of the variance of either, is boimded for sufficiently large n. Hence by Lia- 
pounoff’s theorem (see, for example, Cramer [2], Uspensky [5]), Z\ and are 
jointly normally distributed in the limit. Hence so are y\ and y't and the theorem 
is proved. 


3. Generalization of Theorem^ 1. Examination of the proof of Thecffem 1 
shows that it rests on the following two properties of nms up and down: 

a) Partition of the sequence S into subsequences affects at most d run s in 
each sub-sequence, where d is a fixed positive number independent of n. 

b) -After partition the totals of runs of each length in any sub-sequence (the 
definition now relates to the subsequence) are independent in the probability 
sense of the totals of runs in any other subsequence, and satisfy some cimdition 
(such as the Liapounoff) sufficient to make the components of the sum of the 
vectors jointly normally distributed in the limit. 

Hence if we adopt other definitions of runs which meet conditions (a) and (b) 
above, the total numbers of each of these various kinds of runs will be in general 
jmnriy asymptotically normally distributed. For example, if Sp and be the 
numbers of runs up of length p and of length p or more, respectively, and if 
tp and t'p are riie same quantities referring to runs down, then, with I and k any 
positive integers, 

- S} , Ss , • • • , S| I 8 ( 1 + 1 ) , 




166 


J. WOtiFdWTP* 

are jointly asymptotically normally distributed. However, if i9 included 
in this set, since 

8l = «i + «j +•'• + «« + «(I+W 

and 

<1 = h + + ••• + <* + <(*+1) 

differ by at most one, the limiting distribution is degmerate, i.e., its covariance 
matrix is only semi-dehnite. 

As another example, if we define a bizarre run as, say, tbe occiurence of a run 
up of length 5, followed, 17 elements later, by a run down of length 14, iben the 
number of runs of this type is asymptotically normally distributed with expecta¬ 
tion and variance of order n. 


4. Additive functions of runs of all lengths. Combining the nxunbers of runs 
of all lengths greater than a given length generally involves a loss of information. 
The following theorem on additive functions of runs up and down may be of 
general interest and of utility in avoiding this undesirable situation. 

Theobem 2. Let f(i) be a function, defined for all positive integral values of i, 
which fulfills the following conditions: 
a) There exists a pair of positive integers, a and b, such that 


(4.1) 


f(a) . a 
f(b) b 


b) for any €i > 0 there exists a positive integer N{ti) such that, for all n > N{ei), 


(4.2) 


i—n—1 

E 

t—ATCei) 


!/(*) I ff(n) < tin 


wiiere n, of course, has the same meaning as in the preceding sections. Lei F{S)^ 
a function of the chance sequence S, be defined as follows: 

(4.3) ^’(S) =‘2/(*>.-. 

»-l 


Then the distribution 


F(S) - E[Fm 
■ ■ <r[>(5)] 


approaches the normal distribution as 


n CO. 

As an example, let f{i) = 1. Then F{S) = r(, whose limiting distribution 
is normal by Theorem 1. 

This theorem is the exact analogue of Theorem 2 of [3] and the proof of the 
latter carries over without diflScult changes except in one important respect. A 
difficulty in the proof of the theorem in [3] lay in proving Lemma 4, and this 
lemma has to be proved completely anew. We shall limit ourselves here to doing 
just that. Lemma 2 of Theorem 2 of [3], whose only role was to help in proving 
Lemma 4, has no analogue in our present problem, but all the others do. It 
wilLtherefore be sufficient if we prove the following: 




sunS'-^'wDOWN 

LsioiA. There «a»8t« a ccnMmi o > 0, «ucA that, f&r eM n a^Sbtkidlff tai^ 

(4.4) tf*tF(/S)] > c». 1; ^ 

Ccmdition (a) of the theorem is imposed simply in order Idiat ^ result be 
not trivial. For, if (a) does not h(dd, we have that 

/(») » (f (1). 

and 

F(5) =/(l)Str< 

* (n — 1)/(1) = a constant. 

Suppose that 

f{i) as ui + V, 

with u and v constants, and i; ^ 0. Then by Theorem 1 

F(S) = u(n — 1) + vr'i 
= tTi + n c<Histant 

is asymptotically normally distributed with variance of order n. ' Without loss 
of generality we may therefore assume that 

(4.6) /(t) ^ ui + V. 

From this it follows that there exists an integer A > 2 such that 

(4.6) f(A - 1) + f(A + 1) 2f(A). 

Our object is to prove that (r*[F(S)] is at least of order n. The basic idea of 
the proof will be to construct two sets, say Li and L », of sequences S, such that 
the (same) probability of each is not less than a positive lower bound independent 
of n, and such that there exists a one-to-one correspondence between the se¬ 
quences of Li and those of Lt so that, if is a member of Li and S* the cor¬ 
responding sequence in Lj , 

|F(Si) - F(S,)I > gV^, 

where gf is a positive constant independent of n. It is easy to see that such a 
construction would prove the lemma. 

We shall call the subsequence (hi , ki+i , • • • , A.+j^) of S, a run of type Ti 
or simply a run Ti (the notion will be used only for the proof of this lenuna) 
if the following conditions are fulfilled: 

(4.7) each of the s^s of (fci+i — hi) and (A,+a+i — A.+x) is the initial sign of a 
run of length A. 

(4.8) if i 1, the sign of (hi — /i<-i) is not the final sign of a run of length A. 

(4.9) if t -f 2A n, the sign of (ft,+ 8 . 1+1 — fti+8+) is not the inilial sign of a 
run of length A. 
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(4.10) after the transformation Hy which interchanges and hi^A $ bas 

operated on the run, the sign of — hi) is the initial sign of a run of 
length 4 — the sign of — hi^x), in the new ordering, is the 

initial sign of a run of length ^ + 1. 

Thus, with A — 2 and n = 7, if *S = (7145326), then i? = (— + H-h), 

and (1 4 5 3 2) is a run Ti, for after the transformation H has been applied we 

have (1 6 4 3 2) which gives (H-). The result of the operation i? on a 

run Ti will be called a run . 

The number r* of runs Ti and the number r** of runs Tt each have expected 
values and variances of order n, by considerations similar to those of [1]. Hence, 
for an arbitrarily small positive € there exists a positive constant q such that, for 
all n sufficiently large, the probability P {r* + r** > gn} of the set L* of sequences 
S which satisfy the relation in braces, is not less than 1 — €. 

The set L* can be divided into disjunct sets (families) as follows: Let 5(0) 
be any sequence 5 in L* which has no runs (any doubt about the existence 
of such sequences will be soon removed) and let r*[5(0)] — m. Hence m > qn. 
Operating with the transformation H on each of the m runs Ti of 5(0) we get a 
set 5(1) of m different sequences for each of which r* »= m — 1, r** = 1. Operat¬ 
ing again with H on each of the pairs of runs Ti of the sequence 5(0) we get a 
set 5(2) of (?) distinct sequences for each of which r* = m — 2, r** — 2, etc. 
The process stops with 5(m), which contains a single sequence, for which r* = 0, 
r** «= m. The set 5(t) contains (?) different sequences for each of which r* = 
m — i, r** == i. The union of the sets S(i) (i = 1, 2, • • • , m) will be called the 
family whose generator is 5(0). The sets S{i) are obviously disjunct. Any 
sequence 5 in L* belongs to one and only one family. For if we operate on all 
of its runs Ta with H (which is its own inverse), we obtain the generator of the 
family to which it belongs. This also proves the existence of sequences in L* 
for which r** = 0. 

Consider any family F whose generator is a sequence for which r* m > qn. 
It is easy to see that, when n is sufficiently large, the ratio of the total number of 
sequences 5 in the sets Li and L? , where 


and 


Lt = 




»-0 


S{i), 


Lt - E S(t), 

.•-J(iiH-VS) 


to the total number of sequences in F is greater than a fixed positive constant K'. 

We are now ready to construct Li and Lt ■ The set Li is the union of the sets 
L* of all the families in L*, and the set Lt is the union of the sets Li of all the 
families in L*. The probability of In and of Lg is therefore not leas 
than hK'{l — c). The one-to-one correspondence is effected as follows: The 

subset — i) of the set In oS any famOy is to correspond to the 



'■ V > jtoils-^ ' ■'■'“""I#:' 

^ *“ 0,1» % > • •, j 

family. Tlie iodividiud sequmees of eithw e£ the two oedMetB m^-' be tiuide 
to correspond to those of the other in any manner whatsoever. Any si^itenoe, 
in la and its ccarresponding sequence & in la thus differ on]^ in the aombas 
of runs T{ and Tt , but are identical in the nuntbera of all other nmsi ^ Iheyi 
differ in at least Vw runs. Hence, 

1 F{8i) - F(&) I > v^l W) - M - 1) - /(A + 1) I 

> V^\2m -fiA - 1) -/(A + 1)1 . 

This is the required result with 

g^Vql 2/(A) - /(A - 1) -/(A + 1) I. 

Hence the lemma and the theorem are proved. 

The remarks of section 3 also apply to Theorem 2. 

6. The distribution of long runs. Certain tests in use in quality control of 
manufactured products are based on the occurrence of long runs. Since the 
mean and variance of r,, for any fixed p, are of order n, it follows that the prci}- 
ability that r, 0 approaches 1 (with increasing n). In order to base a test 
on the occurrence of a run of length p in Itmg sequences it is therefore necessary 
to make p a function of n. This function must be a suitable one, because if p 
is, for example, of the order n, the probability that 0 approaches 1; p 
should, therefore, be neither too short nor too long. 

The following theorem will help give the answer to this problem: 

Theorem 3. Lei p vary wiOi n, ao that 

(P + 1)1 _ 1 
. n K 

with K a fixed positive number. Then 

lim P{r, = j] = e-« . U » 0, 1, 2, • * •) 

n-^so 

i.e., Tp has in the limit the Poisson distribution with mean 2K, 

The proof will consist in showing that the moments of Vp approach the moments 
of a Poisson distribution with mean 21iL as n —► oo. This is sufficient (v. Mises 
{4]). 

Let a:« = 1 if the sign of hi^i ~ hi is the initial sign of a run of length p, and 
Xi = 0 otherwise. The probability that Xi == 1 is, by [1], Section [4], 

^ for all i with afixed number of exceptions.* Write B * 

then 

_ P{aj, « 1} * B + 0(B), 

^ Since these exceptions (at the ends of the sequence S) have no effect on the asymptotic 
theory, they will henceforth be ignored. 
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-/D\ 

where the symbol o{P) means that lira = 0. Let (« ** 1,3, • • •, be 

JtS 

independmit chance variables with the same distribution: P{yi = 1} = 

n 

P{yi = Oj = 1 — JB, Then it is easy to see that Y = ^yi has in the limit the 

•■■1 

Poisson distribution with mean 2K and that its moments approach the moments 
of the same Poisson distribution. Hence it will be sufficient to show that in the 
limit Y and Vp have the same moments. 

If g, ai, as , • • • , a, and ii < it < • • • < t, are positive integers, we have that 


Eiyl'yV ••• y‘;) = ••• y,-,) 


(6.1) 

= 

and 

(6.2) 

Also 

0 < E(x1,^x‘i •.. X?/) 

(6.3) 

E{r‘p) = e\ 




After expansion of the right member of (6.3), we may replace, in accord with 
(6.2), each of the non-zero exponents of the by 1. The same operation on the 
terms of the expansion of the right member of 

(5.4) 

is valid in accord with (6.1). 

Let ii < i 2 < • • • < iq . In the expression 

(5.6) £(x,,x,,-..xO, 


let q be the “weight.’^ A subsequence of consecutive x^s in (6.5) (it may consist 
of a single x) which is such that the indices of two consecutive x’s differ by less 
than (p + 3), while the subsequence cannot be expanded on either side without 
violating this requirement, will be called a ^‘cycle.^^ Let c be the number of 
cycles in (5.6). By [1], Section 4, if Xi and xy are in different cycles, 
i.e., I i — i I > (p + 3), then x» and xy are independently distributed. If, 
therefore, g == c, we have that 


(5.6) 


E{Xi^Xi^ 


J-1 


If 9 > c = 1, we have, also from [1], Section 4, that 


( 6 . 7 ) E{xi^Xi^ • • • Xi^) < E{xi^Xi,) = o{B). 
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If 9 > c and if there are two indioes^ili the expreisidoii (S.5) 
than p, thm 



(6.8) E(xi^Xit • • • *<,) - 0. 


For Xi and zy cannot both initiate runs of length p\i\i — j \ <p. 

Let us now return to the expansicms of the right members of (6.3) and (6.4), 
in which the exponents have b^n replaced as described before. Let the weight 
and cycle definitions also apply to terms of the t}^ 


(6.9) 


E(Jfi^yi^ • • • Vu)- 


From (6.1) and ( 6 . 6 ) it fdlows that, in the limit, the contributions to E{r^,) 
and F(F‘) of the sums of those terms for which 9 — c, are fhe same. Let W and 
W' be the sums of all the remain^ terms in E(rp) and E(Y% respectively. If 
we can itiiow that 


( 6 . 10 ) 


lim IT = lim Tf' * 0 


we will have proven that 

(6.11) Iimi?(r‘p) *limS(F') 

and with it the theorem. 

Let B = 0[f(n)] mean, as usual, that | ^ | < Mf(n) for all n and a fixed ilf > 0. 
The number of terms in W with fixed 9 and c (c < q, by definition of IF') is 
Oin°p^~"). From (6.1) the value of the sum of all such terms is 0(B*n"p*~*). 
Now 

nB - 0(1) 

by the hypothesis of the theorem. From the definition of p, 


and hence 


p = o(n) 


pB = o(l). 

Therefore 

BVp*^ = (nB)‘(pB)*-‘ 

= 0 ( 1 ). 

Since 9 < f, there are only a fixed number of such sums. Hence lim W' ^ 0 . 

The number of terms in W with fixed 9 and c (c < 9 ) is 0(n‘p*~‘). Howevw, 
most of these are of the type in ( 6 . 8 ) and therefore vanish. Those which do not 
vanish are 0{n‘) in number. Since 9 > c we have by application of (6.7) that 
each term is o(B"). Hence the value of the sum of these terms is o(n'’B") = 
o(l). Since 9 < 1, there are a fixed number of such sums. Hence lim TF 0 . 
This proves ( 6 . 10 ) and with it the theorem. 
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. It is poBflibte to goieralize ra^t In a maimer siinilar to that of Section 3. 

The author is obliged to W. Allen Wallis who first drew his attenticm to prob¬ 
lems in runs up and down, and to Howard Levene, who read the manuscript of 
this paper. 
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STATISTICAL ANALYSIS Of <^TABI TYffS OF RANOOlf 

FUNCTKNTS 

By H. Huhwitz, Je. and M. Kac 
ComdL VnmrsUy 

1. iatroducSon. In solving certain physical problems (Brownian movements,; 
shot effect) one is often led to the study of superpodtitHis of rand<Hn pulses. 
More precisely, one is led to sums of the type 

( 1 ) Fit) = £/« - (,), 

where N and the t/s are random variables and a function Pit) is given such tiiat 
/ Pit) dt represents the average number of pulses occurring during the time 
interval A. 

We propose to give a fairly detailed treatment of those statistical properties 
of Fit) wldch may be of interest to a physicist and at the same time pay careful 
attention to the mathematical assumptions which underly the applications. It 
may also be pointed out that our results could be applied to the theory of time 
series. 

2. Statistical assumptions and the distribution of AT. The statistical assump¬ 
tions can be formulated as follows: 

1 . The ij8 form an infinite sequence of independent identically distributed 
random variables each having pit) as its probability density. 

2. AT is capable of assuming the values 0,1,2,3, ■ ■ ■ only, and AT is independent 
of the t/a. 

3. If M(A; AT) denotes the nximber of those t/a among the first JV, which fall 
within the interval A, then for non-overlaping intervals Ai and Ai the ran¬ 
dom variables M(Ai; N) and MiAt ; N) are independent. 

We now state our first theorem.* 

Theorem 1. Assumptions 1, 2, 3 imply ikat*N is distributed according to 
Poisson’s law, i.e. 

Prob (AT = rj = 

•+» 

where ^ Pit) d<. 

> For a different approach to Poisson's distribution see W. Fbllbb, Math. Ann. 118 
(1937) in particular pp. 113-lflO. 
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Our proof is based on considerations oi characteristic functi(His. Let 
be 1 if 2 belongs to the intervid A and 0 otiierwise. Thus 

M(A:N) ^ 

From the independence of M(Ai; N) and Af(As; N) it follows that for every 
pair of real numbers £ and v we have^ 

F|^exp ^ 

= E [^exp g 1^-4 ,E ^exp jtij g , 

where E[x] denotes the mathematical expectation, or mean value, of x. Letting 
g(r) == Prob [N = r} and using first the independence of iV and thet/s and then 
the fact that the i/s are independent and identically distributed we obtain 


( 2 ) 


t qirmexp {m^W + #4,(0)}])' 

r-0 


= E g(r)(E[exp E g(r)(Elexp {ivMO}])'. 

f—0 r—0 

All easy calculation gives 

F[exp {i{^4,(0}l = 1 + (e’* - 1) f P(0 dl, 

• •’41 

Flexp {iij\;'48(0}] = 1 + (e’" - 1) f p(0 dt, 

JA2 

E[exp {i(|^4i(0 + #4s(<))l] = 1 + (c‘^ “ 1) f P(0 dt + (e'" - 1) f p(t) dt. 

J Ai •'^2 


The last equation follows from the fact that Ai and As do not overlap. Putting 

( = tj = r, x = l —21 p(t) (U,y = 1 — 2 f p(Jt) dt, ip{x) = Zg{r)x' we see that 
J Ai •'A2 

(2) yields the functional equation 

(3) <pix + y - 1) = <p{x)ip{y). 


One cannot ascertain that (3) holds for all real x and y. First of all the defining 
power series of ip{x) is not known to converge outside the unit circle and secondly 
it is not obvious that each pair of real numbers x^ y betsi^^een —1 and 1 is such 
that non-overlapping intervals Ai, Ai exist for which 


X == 1 



and 



dt. 


* We use the symbol R and E[H] interchangeably to denote the average (matlieinatical 
expectation) of JR. ^ 
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'B.awevet, if odb restricts oneself to small Ai and At the fanotional equation 0 
is sem to hdd in a suffidentiy nnall nd^borhood of 1. This is st^Skaeat '|hi 
view ci the analytidty of in the unit cirde) to determine ^(x). 

In fact, differentiating (3) first with respect to x and ‘then with reelect to y 
we get 

<p'(xW(j/) « <p'\x + - 1 ). 

Letting y » i and putting v>'(l) = h we have 

v''ix) = V(*), 


which yields inunediately 

ip(x) - Ae’“ + B. 

An entirely elementary reasoning (which employs the fact that Ae’“ + B must 
satisfy (3)) leads to the conclusion that B = 0, A = which in turn implies 
at once that 


g(r) = e"* 


hr 

rl* 


Finally, 

Pit) dt = ElMiN; A)] = S [g 1 ^a(A)] 


= (/^ ® ^ ^ A 


and therefore 


[" Pit) dt-- h, Pit) = hpit). 

J-OO 

Since h is the mean value of N (i.e. i^) we shall use instead of h. 


3. Fourier coefiSicients of Fit) and their statistical properties. In physical 
applications it is often convenient to assume that the “pulse function” fit) is 
periodic with period TiT large) and one might therefore restrict oneself to tire 
interval (0, T). 

■ It is furthermore assumed that both fit) and Pit) are suffidentiy smooth* so 
as to justify the formal operations on Fourier series performed below. Since 
we work in the interval (0, T) we assume that Pit) = 0 for < < 0 and ( > T. 

Expanding fit) in a Fourier series in (0, T) we get 

fit) ~ 2 «(«*) exp •<01. = ^, 


* For instance/(O P(t) may be assumed to be of bounded variation. Actually, much 
less severe restrictions suffice but in investigations of this sort far reaching generality would 
only impair the exposition. • 
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m a(fak)b(fah) exp 


&(«*) = £ exp (—iukti)- 


Note that 

^exp (—»«<)] = exp (—»wt)p(<) dt 

and put 

jo +• +• 

£?[^’(0] = -7^ 2 o(«jt)p(«*) exp (iukt) = T 23 a(w*)p(«*) exp (iwtO. 

P\U) —00 —00 


c(w) — is(<a) 




2 e®8 (fjtkij) ~ iVc(«*) 


n 


(if) _ /-I 


Vn 

2 sin («*ty) - i9^«(w») 


VW 


Thus remembering that ^ ^ P(f) ^ ^e may write 


^ exp (iw»0 


or 


&\ ~ VT f: a(«*)(Xi^> - *Tf>) exp 

Vp(0) — 

We can now state the following: 

Theorem 2. In the limii ae ft —* <» each X»^ (ond Yh*^) ienormally diebrih- 
vied vnth mean 0 and variance ^ + ^(2w») — |e(2(i>i)). 

The proof, as usual, is based on the consideration of the characteristic function 

ofXf> 
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We have ; ? , 

= exp c(«*)} exp (-J?) g ^ ^JEf [exp^ j ^ 

* exp {-• t{V^ c(«i)} exp (—ff) exp |^exp . i 

In deriving this formula use has been made of the facts that the t/a axe inde¬ 
pendent and identically distributed, that N is ind^iendait of the t/a and that 
iV^ is distributed according to Pdsson’s law. It is now e^ to see that as J9'-> « 
the characteristic function of approaches 

exp {-(i + ic(2w»))f*} 

uniformly in every finite f-interval. Ihis, in view of the continuity theory 
for Fourier-Stieljes transforms, implies our theorem. It should be mentioned 
that it is tacitly assumed that even thou^ N TpiO) approaches « it does it 
in such a way that the ratio p(w)//i>(0) (and hence e(a)) remains constant (or 
more generally, approaches a limit). 

By considering the characteristic function of the joint distribution of 
and Xt^\ I A: 1 9 ^ 1 i |) (or any other pair like, for instance, Xi^ and F{^’, in 
which case no restriction on k, Z is necessary) we are able to prove 

Teceobeh 3. In the limit as -* 00 ffie disUnd Fourier coeffieimls (F(t) — 
F[F{t)])/'\/ p(0) are normally correlated ({.e. their joint distribuHon function is the 
bivariate normal distribution). 

It is also clear that the higher correlations (i.e. between more than two coef¬ 
ficients) will lead to multivariate normal distributions with coefficients expresisdble 
in terms of Fourier coefficients of* Pit). 

We do not state Theorem 3 in more definite terms because in the next section 
we shall give a more convenient and useful way of handling correlation proper¬ 
ties of our Fourier coefficients. 

4. Statistical structure of Fourier coefficimits. Let us assume that Pit) > 
y > 0 and that the Fourier series of Pit) converges eveiywhwe. 

Expandmg \/P(Z) in a Fouri^' series in (0, T) we have 

so 

VPit) = ]C v(»i) exp (twit), 

—«0 

and in particular (since pit) = Pit)/lf) 

y/piti) = g rim) exp (twity). 
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We caa now write 

Hm) “ S exp (-*w*ty) *= 2 ***l/r^*~ 


i-i 




y/piti) 
exp (i(ci)i — 

Vp^ / 

exp (twt</) l 
>/?(«/) J 


’{■s^S 


exp (wj^) 


Vp(^) "4 


— ^ Wjb)<r(—Wi) + Wfc) 

<0 

= r a(o)i + «*)a(—«j) 

I*—«0 

+ V? ,£.(«+«*){vff. § - V?.-(-«.)}. 

Put or(o>) =s oe(6>) + ij8(w), note that by Parseval's relation 

eo 

p(cJk) = ^52 + w«)<r(—wi), 

and introduce random variables and by means of the formulas 

rr(^) 1 ^ /Tf. ( X 

yf'- - y/f sw>. 


VJtU V'piti) 


Thus 


6(«t) = TpM + VT i, «^U + 

I—<0 

and we have the following theorem. 

TEmoBBM 4. In the limit as N » the random variables Ui^\ Ui^\ 

U%^\ Vi^, •--are independent and normaUy disbribvied {each with mean 0 and 
variance ^). 

This the<nrem can be proved in a manner exactly analogous to that of Theorem 
2 . We need only consider the characteristic functions of the joint distributions 
of U’b and T’s and treat them in the same way as we treated the characteristic 
function of the distributicm of a single X in the proof of Theorem 2. One thing, 
however, should be strongly emphasized. The proof of independence (in the 
limit as oo) of and (111?^ I»»1), for instance, depends on prov¬ 
ing that 



STATISTICAL ANALTUS 07 RANOOU fONCTlOKS 1^9 : 


Ilus in turn d^imds essentisUy tiie fact that AT is dlsti&uted aooordii^ 
Poisson’s law. 

In fact, •* 








K A COS (at ^ )\/a cos _ 

#-l Vp(ti) /\^1 Vpitf) /] 


Ta(vt)u(um). 


But 


E 


r /A cos {uiti)\ 

/A cos (w «<,)\1 

Lv« vmr 

Vp(<#) /J 


= e\± + ^r£ 

Li-1 piw J 


COS CJI ^ cos <am 

V^)Vr^) J 


and finally 

Eiuf^uL^^] = - i)r«(a„)«(«,). 

Since for Poisson’s distribution f}* = N + (R)* we get 

E[Uf^ = 0. 

Also the proof that E[ | I ^ employs essentially the fact that N is 
distributed according to Poisson’s law. 

In view of Theorem 4 we can restate Theorem 3 in a form which is both useful 
and illuminating inasmuch as it describes completely the statistical structure 
of the 6 (wt)’s and hence of the Fourier coefiicients of F(jt). 

Theorem 5. For the purposes of finding correlations between the bimYs it 
sufiices to repkux each 6 (w*) {in the limit as R —* ») by its"staHslical representor 
tion” 


Tp{uh) + Vr ]C »■(«* + wMi. 


where A-i is the complex conjugate ofAi,Ao,Ai,At, • ■ • a sequence of independent 
complex-valued random variables and each Ai, is distributed in such a way that 
$k = arg is uniformly distributed independent of Ak and the density of the prob¬ 
ability distribution of | At | is 

2Ae~^\ (A > 0 ). 

Theorem 5 was proved under the assumption P(<) > 7 > 0. This assumptioii 
was needed to validate the convenient artifice of multipl 3 dng and dividing by 

Vi^- 

However, even in the case when P{t) is not bounded from below by a positive 
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number (it is always true that Pit) > 0) Theorem 5 remains true. It could be 
proved by direct but tedious ccmsiderations suggested in section 2. 

Tlreorems 4 and 5 can be easily extended to the case whar the pulses all have 
the same shape but may, at random, differ in magnitude. In other words, 
instead of sum (1) we may consider the sum 

(4) 

1-1 

where the individual pulses are independent and a function P(€, t) is given such 
that 



t+At 


Pit, t) dt dt 


is the average number of pulses of ‘‘amplitude” between t and « + Ac occurring 
between t and ^ + A^. 

Theorems 4 and 5 still hold provided one replaces the Fourier coefficients of 
Pit) by those of 

and the Fourier coefficients of y/Pit) by those of 


6 . Concluding remarks and summary. If one assumes that the number of 
pulses N in the time interval (0, T) is constant instead of being a random variable 
obeying Poisson^s law, then Theorems 4 and 5 fail. The failure is due to the 
fact that, for instance E[Ui^^Uj^^] is no longer 0. However, as T « the 
changes in correlation due to assuming N constant become negligible. On the 
other hand if one assumes that the number of pulses in each of the time intervals 
(0, r), (r, 2r), - is fixed, the changes in correlations become appreciable. 
This case can also be treated by the above methods. 

The case in which p{t) is independent of time has been considered in various 
connections by Schottky, Uhlenbeck and Goudsmidt' and Rice^ Their investi¬ 
gations emphasized the importance and usefulness .of the harmonic analysis 
of random functions. 

In conclusion we summarize our results for the case of time-dependent P(€, t) 

* W. Schottky, Ann. d. Phys. 67 (1919) pp. 641-567. 

G. E. Uhlenbeck and S. Goudsmidt, Phys. Rev. 34 (1929) pp. 145-151. 

S. O. Rice, mimeographed notes on mathematical analysis of random noise, as yet 
unpublished. 

The authors are indebted to Mr. Rice for making his notes available to them. 
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by observing that in applioations <»ie may replace Fit) by its “Statu^wl 
representation” 

(6) B[F(0] + \/r 2 o(wt)| S ff(«i + wt)^j|exp (tw*0. 

fc — eo «0 ) 


where 


2rk 

- -y. 


^F(<)] = 2’ £ aim)piuk) exp (iukt), 

—•0 

««e so 

/ «P(«, 0 “ Z) p(«*) exp iiukt), 

•t-ee ib " so 

“ *5. 

and the ilj's are normally distributed complex-valued random variables for which 

E[Ai] = Q, PfU, 1*1=1, = 

Furthermore, for I > 0 the ill’s are statistically independent. 

Thus 


Fit) - ~ ^ \im) exp M 

where the X’s are normally distributed complex-valued random variables obeying 
the relation 

^[|X(«)r]=r |o(«)|’j["Q{0d<. 


If Q(t) is periodic with frequency ^ then it follows that X(<tf') and X(w") are 

2t 

independent unless «' -I- u" or (/-•«" is an integral multiple of ci»»,. 

Finally, we mention that Fit) — ^F(0] is normtmy distributed with variance 
.s(0 given by the formula 

«*(<) = MiFit) - .B(F(<)))*] = r .£ yMf*M exp imt), 

where yiwk) is the Fourier coefficient of Q(t) and piuk) the Fourier coefficient 
of fit). ■ 



RANDOM ALMS 

By Paul R. Halmos 
Syracum University 


1* Statement of the problem. Consider the problem of distributing one 
pound of gold dust at random among a countably infinite set of beggars. Let the 
beggars be enumerated and let the procedure for distribution be as follows: 
the first beggar is given a random portion of the gold; the second beggar gets a 
random portion of the remainder; • • • and so on ad infinitum. In this descrip¬ 
tion the phrase random portion^’ occurs an infinite number of times: it seems 
reasonable to require that it have the same interpretation each time. To be 
precise: let x, (j == 0, 1, 2, • • •) be the amount received by the jth beggar. Let 
the distribution of Xo be given by a density function p(X): 


( 1 ) 

( 2 ) 

(3) 


p(x) ^0, 0 g X g 1; 

j[‘p(X)dX = ]; 


P(a < xo < 



0 g a < 6 g 1. 


After the first beggar has received his alms and the amount of gold dust left is 
(i.e. Xb = 1 — m)> the value of Xi will be between 0 and y. The uniformity 
requirement mentioned above means that the proportion of y that the second 
beggar is-to receive is again determined by the probability density p: in other 
words the conditional probability that Xi be between \y and (X + d\)y, given 
that Xo = 1 — Mj is p(X) dX. In symbols: 


(4) 


P(ay < Xi < by \xo — I 



dX. 


Writing a ay, ^ by, (4) becomes 

(6) P(a <Xi</8|xo = l— m) = J dX. 

More generally I shall assume that the conditional probability distribution of 
Xn, assuming that after the preceding donations there is left an amount y, is 

given in the interval (0, y) by ^p . In symbols: 

(6) P(a < Xn < 61 ]C 1 “ m) ~ f ~pf“^dX, 0 g a < b g /i. 
i<n Ja y \y/ 

This assumption completely determines (in terms of p) the joint distribution 
of the whole infinite sequence {xo, xi, X 2 , •••}. Several interesting special 
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questions a^y be'esked about this dis1xibuti<HU< For eoqunple: What are 
expectation, dispersi<ni, and higher moments of the Xn ? Wbat, simUarly, are 
the monients of the partial mun 8n Mme geaierally what aie .the 

exact distributions of in and of 8n ? Will the process described really distribute 
all the gold, or is there a positive probability that some is left even after every 
beggar had his turn? WWt is the rate of convergence of the series ? 

It is the purpose of this paper to answer these and a few related questions. 


2. Calculation of distributions. The n + 1 dimensional probability density 
of the distribution of (**>, a:i ,•••,*«) is given by* 


(7) 





in the region defined by ^ 0, Xo + • • • + Xn g 1. For n * 0 there is only 
one term in the product and that one is equal to p(Xo); the region is defined by 
0 ^ Xo ^ 1. The formula reduces in this case to the definition of the distribution 
of Xo . The general case follows inductively by the use of the conditional prob¬ 
ability formula (6). ^For example; P(xo == Xo, Xi = Xi) = P(xo = \o)P(xi « 

Xx I ^6 = Xo) = p(Xo) P (j^) .) 

From (7) it is possible in principle to calculate the densities of the distributions 
of x„ and of Sn. Thus for example the density g„ of the distribution of x„ 
is found by integrating out the Xy with j < n from (7), so that 


( 8 ) 


- / ■ ■ • / iP. 1 - z,<. X, ” (i - z;,<. X,) ’ 


where the integration is extended over the region defined by Xy ^ 0 (0 ^ i g n), 
^ 1- Similarly 7„(<) == ,P(5n < 0 is given by 


(9) s 1 -£,«x,’’(i -E,<.x;)'®“" ‘^-’ 


(0 ^ ^ 1) where the domain of integration is defined by Xy S 0 (0 g j g »), 

Working with integrals of the type (8) and (9) is often greatly facilitated by the 
substitution Xy, (X< = /« ~ 0 ^ i ^ n. The Jacobian of this 

linear change of variables is identically one. The domain of integration used 
in (9) is defined in terms of the p’s by 0 ^ mo ^ m ^ ^ ^ f ^ 1, so that 



^ A sumraation or a product extended over an empty set of indices will, as is customary, 
be interpreted as 0 or 1 respectively. Since throughout this paper only non-negative indices 
are considered, whenever the notation indicate a negative index the quantity to which it is 
attached is to be interpreted as 0. 
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Hence the density of the distribution of S» is 

«»n(0 = [ djttn-l / dun-i [ d/iO IJ ^- P (j - 

•h j» J$ i<n 1 — m-1 \1 - fH-l/ 

_1 

1 — \1 — fln-l/ 

For later purposes it is more convenient to set < = m» in (H) and to express 
t'n(Mn) as a multiple (and not as an iterated) integral; then 

(12) vM = f • • • / n r— P dpo • • • dun-i, 

J J iSnl — Mi-l \1 — M<-1/ 

where the domain of integration is defined by 0 g ^ Mi ^ ^ /*n-i ^ 

Mn ^ 1. The integrals (8) and (12) are explicitly evaluated below for a special 
case. 

It is possible from (8) to find the ith moment of Xn, Af*"’ = 

jf Xt3n(X»)dXn. Write 

” X)*p(X) dX. 

Clearly Af*"’ is obtained from (8) upon multiplication by Xl and integration 
with respect to Xn. 

(13) = / ■.. / X* n p • 

It is advantageous once again to write m = Xy. The resulting integral 
may be written in the iterated form as follows: 

Mi"’ = f d(M> [' dMi • • • f dMn 11 r -— 

Jq Jfio •'mr-i A — 

(1^) y V 

Consider separately the innermost integral 

T f « 1\ / \k d/in 

J = / pi - I -- . 

\ 1 - Mn-1 / 1 - Mn~l 

Writing X = (//n — Mn-i)/(l — Mn~i) this becomes 

J — f p(X)X*(l — Pn-O* dX = ajb(l — Mn-i)*'. 

JO 

Hence 

(15) M*"’ — oik [ diiD ■“ f dftn-i n ^— - - p(t -^Y(l “ i“«-i)** 

Jo •'Mn-a <<»» t ““ M»--l \1 M/-l/ 
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The innermost integral this time is 


•'Mis-j \ a — Mn-t / 1 — Mn-a 

Write X = (m»-i - /»»- 2 )/(l - then (1 - * (1 - X)(l - m.-») and 

J' = j(* P(X)(1 - x)*(l - Mn-«)*dX = A(1 - 
Hence, finally, 

= afcjSjt f dflo ••• [ dfJLn^i IX 

(16) 

1 - /1.-/V1 - 

Observe now that the right member of (16) (except for the factor 0k) may be 
obtained from (15) upon replacing n by n ~ 1 . In other words = 0kMh*^'^^\ 
Since = ak , it follows that 

(17) n = 0, 1, 2, 

Instead of calculating similarly the moments / /in v»ii*n) d/i„ of S» it is more 

Jo 

convenient to calculate the quantities 

Wi-’ = j[' (1 - Mn)^n(/ln) d/ln . 

The moments themselves may be obtained from the N^& by simple combinatorial 
formulas. 

It follows from (12) that * 


(18) Nl”^ = f d/lo f djU, • . . f d/l„ n P (1 - Mn)*. 

•'o •'no •'n.-i .Sn 1 — Mi-I \1 — /H_i/ ' 

The innermost integral in (18) is 

r ^fp (I - /i„)*J^-^. 

•'n,-i \l — Pn-l/ |1 — Mn-1 

Writing X = (/In - /in-i)/(l - Mn-i), (1 - /In) bccomcs (1 - X){1 - /In-i), SO that 

j" = j[’pW(l - M*(l - /«n-l)*dX = dtd - /In-l)*. 

Consequently • 

i I n r :.-- - P(t ""*~* )-( 1 - A«n-i)* 

Jo Jm «-2 *<n 1 — Mi-I \1 — M»~l/ 


(19) 
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BO that ' ; 

(20) = /sr*, n * 0, 1, 2, •.. . 

The additivity of the first moment yields an amusing check on (17) and (20). 
Since E!{Sn) = == X)y^ni?(^i) (where E denotes expectation, or 

first moment), it should be true that 1 — iVr^ = In terms of a's 

and this means 1 — = ai , and this in turn reduces to the trivial 

identity ai = 1 — j&i. 

Since 0 g xj ^ 1 with probability 1 for every n, it is clear that the series 
converges with probability 1 to a sum x, 0 ^ a; g 1. Since E{xj) = 
ai^i and since E(x) = ^j^oEix^), it follows that E(x) = = ai/ 

(1 — j3i) = L This implies (since 0 ^ x g 1) that x must be equal to 1 with 
probability 1. In other words it is almost certain that all the gold dust will 
eventually be distributed. 

3. Product representation. Considerable light is shed on some of the above 
computations (and in fact the moment formulas (17) and (20) are proved anew) 
by the following considerations. The principle of equitable treatment enun¬ 
ciated in the introductory paragraph was subsequently formalized by the condi¬ 
tional probability relation (6). It may also be formalized by the following 
(equivalent) procedure. I^et j/o, Viy ^ 2 , * • • be a sequence of independent 
chance variables each of whose distributions is given by the probability density p; 
let yn be interpreted as the proportion, of the amount available to the nth beggar, 
that he actually receives. In other words 

(21) x„ = 2 /„(l - 23 *»). w = 0, 1, 2, • • • . 

y<n 

The first main problem in this formulation is to express the x^s in terms of the 
y^8. This is most easily accomplished by an inductive proof of the formula 

( 22 ) 23 a:, = 1 - n (1 - Vi)- 

j^n j^n 

For n = 0, (22) asserts merely that xo = 2/o • The inductive step proceeds as 
follows: 

23 


« = y» n (1 - Vi) 
y<» 


= Xn+ Xy = 2/n(l - 2 ^y) + H 

y s n-i y %. y s i 

= Vn n (1 - j/») +1 “ n (1 - Vi ) ■ 

i s n-l i % n-1 

= 1 - (1 - y„) n (1 - j/i) = 1 - n (1 - y/)- 

• is n~l iSn 


From (22) it follows that 
(23) 


X, 





(24) . i?» « 1 - |S« ■« 1 - ]C »i “ n (X - Vi)- 

/S'* ■ /s« 

The moment formulas (17) and (20) follow immediately from (^) and (24) 
respectively. , 

Another very important application of (23) and (24) is the following theorem. 
If the first geometric moment (geometric mean) 


r = exp {S(log [1 — j/,])} = exp < / log (1 — X)p(X) dX 


is different from zero 


(l.e. If { 


log (1 — X)p(X) dX is finite) then the limits 


lim (Xn/Vn)^'" and lim R\ 


both exist and are both equal to r. 

Since according to (23) and (24), Xn/y„ = Rn-i the two parts of the conclusion 
are seen to be equivalent. For the proof take the Ic^arithm of both sides of (24) 
and divide by n, obtaining 


log ft*'* « I E/gn log (1 - Vi). 


Since, according to the hypotheses stated, the chance variables log (1 — yy) 
are independent and all have the same distribution with a finite expectation, 
the strong law of large numbers applies to the right side of (25) and (after 
taking exponentials) yields the desired conclusion. 

The result just obtained may be phrased as follows: with probability 1 Xn 
is asymptotically equal to r'^yn . This statement shows that in an obvious if 
somewhat crude sense the rate of convergence of xy is that (at least) of a 
geometric series with ratio r. This conclusion is further supported by the 
behavior of Rn , which again is the sort of thing one expects from a geometric 
series. (That is: the nth root of the nth remainder of a geometric series always 
does converge to the common ratio.) As usual, more delicate quantitative 
results concerning the rate of conveigence may be obtained by applying to 
(26) not merely the law of large numbers but the law of the iterated logarithm. 

The product representation of Zn in formula (23) points the way to a generaliza¬ 
tion of this theory which may be of some interest. In this generalization Zn 
is still defined by (23) and the y^s are still independent, but the distribution of y^ 
is given by a density p/, where the p’s need not be equal to each other. In 
terms of random alms this means that the condition of equitable treatment is 
replaced by the following weaker condition: the probability distribution of the 
amount that the jth beggar receives depends only on j and on the amount left 
by the preceding beggars, and in particular does not depend on the sizes of the 
alms already distributed. Many of the conclusions obtained under the simpler 
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hypotheses carry over to this generalized case with only slight changes. In 
particular the distribution formulas (7), (8), and (12), and the moment formulas 
(17) and (20), are changed only to the extent of acquiring an extra subscript 
due to the difference of the p, . 

4* Applications. (A) The onginal motivation of the present work was an 
investigation of the notion of a random mass distribution, and the results ob¬ 
tained may be considered as one possible solution of the problem of defining 
randomness for mass distributions in the special (discrete) case where the entire 
mass is concentrated on the non-negative integers. It would be of gi*eat interest 
to extend the results of this note to various continuous cases in which the set of 
integers is replaced by the unit interval, or the entire real line, or n dimensional 
Euclidean space. I intend to study some of these extensions at another time; 
at thq moment I merely mention one implication of this statistical point of view. 

Considering the sequence {xo, , 0 : 2 , * • •} as a system of weights, the integer 

n carrying the weight Xn , various questions may be raised concerning properties 
of the discrete mass distributions so obtained. For example: do the moments 
rrik = exist and, if so, what are their averages and dispersions and, 

more generally, their moments and their distributions? I shall settle here the 
questions concerning existence and expectation. 

The chance variable Mk is non-negative and, even it if is infinite with positive 
probability, its expectation is defined by E(mk) == ^n^on^E(Xn) = 
= X)n>on*ai)3r. Since 0 < ffi < I, the last written series con¬ 
verges and therefore E{w^k) is finite. This implies that rrik is finite with prob¬ 
ability 1. 

(B) It has been observed that the logarithms of the sizes of particles such as 
mineral grains are frequently normally distributed. Kolmogoroff* has given 
an explanation of this phenomenon; the results of the present paper yield an 
alternative and in some respects simpler explanation. Suppose in fact that the 
probability of a particle losing a chip the proportion of whose size to the size 
of the original particle is between X and X -|- dX is p(X) dX. With this stochastic 
scheme the size of the remaining particle after n chips have been lost is given by 
/2n . Since, by (25), log Rn is a sum of independent chance variables with the 
same distributions, the Laplace-Liapounoff theorem may be invoked to show 
that the distribution of Rn is for large n nearly normal. (It is necessary of course 
to assume here the finiteness of the second geometric moment, or equivalently 

of the integral [w (1 — X)p(X) dX.) The mean and th^ variance of each- 

summand of log Rn are 

a = log (1 — X)p(X) dX and (1 — X) — af p(X) dX, 

* A. N. Kolmogoroff, “Ueber das logarithmisch normale Verteilungsgesetz der Dimen- 
sionen der Teilchen bei Zersiueckelung,” C. R. (Doklady) Acad. Sci, URSS (N. S.) Vol. 
31(1941), pp. 99-101. 
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ree^iectivdy; coosequently (by the additivity of the mem and the vaa^ee) 
the eom^KHiu^iig panua^^ of the distribolion of log (and henee of ,the< 
approximating normal distnbQtipn) are. given by (n + l}a and (n -f 1)&* re¬ 
spectively. • 

(C) A special case of the distributicms studied in this paper (namely ^e case 
of uniform distribution, p(X) » 1) arises in the theory of scattering of neutnms 
by protons of the same mass. According to Bethe*: “In each coUision with a 
proton the neutron will lose energy. As l<mg as the neutron is fast compared 
to the proton, the probability that the neutron energy lies between E and E + dE 
after the collidon, is w{E) dE = dE/Et , where Et is the neutron energy before 
the collision. This means that any value of the final energy of the neutron, 
between 0 and the initial energy Et , is equally probable.” 

To calculate explicitly the distributions it is most convenient to start from 
(11). If p (with any argument) is replaced by 1 and the terms of the product 
are distributed, each under its own differential, (11) takes the form 


( 26 ) 



r dm 

’0 1 — Mo' 


The value of the iterated integral is easy to obtain: Vn(i) — (—l)"(l/n!) 
log" (1 — <). Since Vn(i) gives the distribution of the partial sum Sn , the distri¬ 
bution of = 1 — /Sn is given by i>„(l — 0 = (- l)"(l/»!) log" i* It is possible 
but not necessary to derive similarly the distribution of x„. It is simpler to 
obtain this distribution by exploiting the symmetry of the uniform distribution. 
Since, according to (23) and (24), x„ and R„ are both products of n -|- 1 uniformly 
and independently distributed chance variables they have the same distribution, 
so that the density of the distribution of is also given by ( —l)”(l/n!) log" t, 
n = 0, 1, 2, • • • . 

The roles of the geometric mean r (= 1/e in case p = 1) and of the normal 
distribution have also been observed in the physical situation. Fermi* has 
expressed the geometric series like behavior of by the statement “ • • • 

an impact of a neutron against a proton reduces, on the average, the neutron 
energy by a factor 1/e,” and Bethe* remarks that “ • • • the actual values of log E 
after n collisions form very nearly a Gaussian distribution • • •” 


’ H. A. Bethe, “Nuclear Physios, B. Nuclear Dynamics, Theoretical,” Reviews of Modern 
Physics, Vol. 9(1937) p. 120. 

‘This distribution has been calculated by E. U. Condon and G. Breit, “The energy 
distribution of neutrons slowed by elastic impacts,” Physical Review, Vol. 49(1936} pp. 
229-231. 

* Quoted by CondoQ and Breit, loc.cit. 

• hoc, cit. m 



ON BIASES IN ESTIMATION DUE TO THE USB (^ PRELIMINARY 
TESTS OF SIONmCANCE 

By T. a. Bancroft 
Iowa Stale College 

I. INTRODUCTION 

In problems of statistical estimation we often express the joint frequency 
distribution of the sample observations Xi, xs, • • • x« in the f(mn 

( 1 ) M, ••• ,a;n;«,/?,% --OndXi, (i« 1 , ••• ,n) 

where the functional form, /, is assumed known, and a, fi,y, • * • are certain popu¬ 
lation parameters whose values may or may not be known. Given this specifica¬ 
tion, statistical theory provides routine mathematical processes for obtaining 
estimates of the parameters a, /S, 7 , * * • from the observations ari, 0 ^ 2 , • • • , . 

In performing tests of significance we often assume that the data follow some 
distribution 

( 2 ) fi{xi , • • • , Xh ; a, iS, 7 , • • on da;,- , (f =* l, ... , n) 

where fi is a known function or family of functions. We may wish to test the 
hypothesis that the data follow the more specialized distribution 

(3) ) • • • , Xn J ot, ^ , 7 > * * * )1I dxi , (i = 1, • • • , w) 

where fi is some member or sub-group of the family /i . Given this specification, 
statistical theory provides routine mathematical processes for testing such 
hypotheses. 

In the application of statistical theory to specific data, there is often some 
uncertainty about the appropriate specifications in equations ( 1 ), ( 2 ) and ( 3 ). 
In such cases preliminary tests of significance have been used, in practice, as 
an aid in choosing a specification. We shall give several examples from the 
literature of statistical methodology. 

( 1 ) In an analysis of variance, in order to obtain a best estimate of variance, 
we may be uncertain as to whether two mean squares in the lines of the analysis 
may be assumed homogeneous, [ 1 ]. Suppose that it is desired to estimate the 
variance al , of which an unbiased estimate si is available. In addition, there 
is an unbiased estimate si of al , where from the nature of the data it is known 
that either al — A or al < a\ , As a criterion in making a decision the following 
rule of procedure is used frequently: test si/si by the F-test, where si and si 
are the two mean squares. If F is not significant at some Jlsigned significance 
level use (jhsl + 7 ^ 282 )/(ni + Ui) as the estimate of A . If F is significant at the 
assigned significance level, use si as the estimate of A . 

(2) After working out the regression of y on a number of independent variates 
we may be uncertain as to the appropriateness of the retention of some one of 
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tlw indepeodmit variates, {2]. To iUujgtrate let us oonisder the choice 
the ngv^tai equations y » htxi + hi«i and y' « hia^, alter havMl fittcid 
y » + hia^ ; the poinilation r^ressioa equarion being y » ^idsi •¥ <~ 

In this case a procedure commonly used in deciding whether to retain is as 
fdlows: we test «i/st by the F-test, where 4 is tiie reduction kt sum of squares 
due to xt after fitting Zi, and «* is the residual mean square. If F is not aignifi-- 
cant at some assigned s^snificance level we omit the term containing xt and use 
bi as the estimate of j9i. If F is significant we retain the term containing in 
and use bi as the estimate of j9i. A similar example occurs in fitting a poly> 
nomial, when there is uncertainty as to the appropriate degree for the poly> 
nomial [3]. 

(3) In certain analyses we may be uncertain as to the appropriateness of the 
use of the x* test. Bartlett gives an illustration in a dismission of binomial 
variation, [4]. He performs two supplementary x* tests of significance as an aid 
in deciding to abandon the main use of the x* test altogether, and proceeds to use 
an analysis of variance instead. It is of interest to note that the main use of the 
X* test gives a significant difference at the 6% level while, in the analysis of 
variance, Fisher’s z is not significant at the 6% level. Here again we might 
formulate a “rule of procedure” and follow through the analysis as in the pre¬ 
ceding cases. 

This use of tests of significance as an aid in determining an appropriate speci¬ 
fication, and hence the form that the completed analysis shall take, involves 
acting as if the null hypothesis is false in those cases in which it is refuted at some 
assigned significance level, and, on the other hand, acting as if the null hypothesis 
is true in those cases in which we fail to refute it at the assigned significance level. 
An investigation of the consequences of some of these uses is the purpose of this 
paper. 

It is proposed to consider the first two cases mentioned above: (1) a test of tiie 
homogeneity of variances, and (2) a test of a regression coefficient. A complete 
investigation of the consequences of the rules of procedure would be very extmi- 
sive, since these consequences depend on the form of the subsequent statistical 
analysis. As a beginning, it is proposed to limit the study to the efficiency of 
these “rules of procedure” in the control of bias. 

The need for solutions of a whole family of problems of this kind has been 
pointed out recently by Berkson [6]. 

II. EXAMPLE ONE: TEST OF HOMOGENEITY OF VARIANCES 

1. Statement of the ptoUeni. 8^ and 4 are two independent estimates of 
variances v\ and 4 respectively, (such that ni4/4, n««l/4 are distributed inde- 
pendently according to xi mid x«, with ni and n% degrees of freedom). It is 
known t^t a\ < a\. To obtfun from these an estimate of 4, to be used in the 
particular analysis in hand, we formulate a rule of procedure. 

2. Rule of procedure. Test 4/4 by the F-test. If F is non-significant at 
some assigned cdgnificance level, we use (ni4 + niSi)/(ni + ni) as the estimate 
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of ffi. If F is s^ficant at some assigned significance levd we use as Hie 
estimate of tri. The estimate of o-‘ obtained by this rule of procedure mil be 
denoted by e*. 

S. Object of this mvestig^tion. If we follow such a rule of procedure, what, 
vill be the bias in our estimate e* of <ri ? 


4. Derivation of the expected value of e*. First we wish to find 


p/niSi + nsSjN 

m + n* )’ 


if < X, 

82 


where X is the value on the F-distribution corresponding to some assigned sig¬ 
nificance level for ni and n2 degrees of freedom. 

Let vi = , V2 = sL Since and si are independently distributed, the joint 

distribution of vi and V2 is 


^exp 



dvi du 2 , 


where ci is a constant and ui and rh are the respective degrees of freedom. 
Let us make the transformation of variables 


ui = niVi + n2t;2, 0 < < « 

U2 0 < U 2 < \ f 

V 2 


then the expected value, Ei , of 


Ui 


ni + ^2 


for 1^2 < X is given by 


(ni 


+ “ P5iVx) i i S' 




(ni«* + 






dU 2 


where P{u2 < X) is the probability of being less than X. 

Integrating out Ui and expressing the result in terms of the incomplete beta 
function we obtain 

( 4 ) + n2)Ei = + n2lxji(hni ,^n2 + 1)^1 

P{U 2 < X) 

where Xo = (ni^X)/(?i2 + ni^X), <p = <r2/<ri. 

We wish now to find the expected value of si when Si/s2 > X. Again we start 
with the joint distribution of vi and V2 , given above and this time let 



Vl - Vi , 



then tlte expected value, Et , of vi when F S | is 

1 
X 

Integrating out vi as a gamma function, and expressing the results as incomldete 
beta functions we obtain 


p(Fg 


JT exp + ^jdPtdF. 


( 6 ) 


Bi _ [1 “ Ixidni + 1 , iin})]<ri 

P(Y < 1/X) 


where 


xo = nivX/(nj + ni^X) as before. 


6. Final Results. The probability that we use (ni«i + nj«*)/(ni + nj) is 
P(ttj < X). From equation (4) the contribution from this case to the mean 
value of e* is 


nihodni H- 1 , in})<ri + 1 )<^« 

ni + n* 

The probability that we use s? is P(F < 1/X). From equation (6) the contribu¬ 
tion from this case is 

[1 — /*,(Jni 1 , . 


The expected value of e* is obtained by combining the two cases, i.e., 

(6) E(e*) = fl -I-X— > iw* + 1) T “ • 

1_ ni + n* ffi )J 

Hence the bias in e*, expressed as a fraction of v* is 

( 7 ) —X— , in* + 1 ) T + 1 > i”»)l 

ni + n* L ffi . J 


We note that in estimating o-? there will be a positive bias, no bias, or a n^ative 
bias according as 


J«o(ini, jna + 1 ) 
if*o(i”'i + i”*) 

is greater than, equal to, or less than a\/<r \. 

6. Identity and checks. If a\ = <rl, then in section 4, Ei = <ri and 
P(«* < X) = /„(ini, in,). 
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From (4) ttda gives the identity 

(«! + , in*) =■ + 1| in*) + n»7»,(ini, }n* + 1) 

wha% Xo as niX/(nt + niX). This identity no^y be established easily by ele¬ 
mentary calculus. 

The first result in equation (6) may be checked by noting that when \ ^ 
i.e. when the two mean squares are always pooled, Xo is 1 and equation (6) 
reduces to (nicr* -f n*ff*)/(ni -|- nj). Similarly when X » 0, in which case there 
is no pooling, xg = 0, and equation (6) reduces to vl. 


7. Discussion. In making a choice of an appropriate estimate of we may 
consider three procedures: 

(1) Use 8* always. This has the merit of having no bias, but is likely to have 
a large sampling error. 

2 , 2 

(2) Always pool, i.e., use —^ ; —- . When al 9 ^ al this is biased, but in 

ni + fii 


compensation will have less sampling error than (1) since it will be based on 
(ni + 712 ) degrees of freedom. 

2 

(3) Use the test of significance of ~ as a criterion in making the decision 

82 


as to whether to pool the two mean squares or not. If the test discriminates 
properly between cases where pooling should or should not be made, the pre¬ 
liminary test of significance criterion will utilize the extra n 2 degrees of freedom 
whenever permissible and also avoid the bias in method (2). 

In Table I the expected value E(e*) divided by al , is given for two sets of 
values of ni, 712 somewhat typical of those frequently encountered in applied 
work, and for a series of values of <r 2 /<ri. In addition to the case of always pool¬ 
ing (X == 00 ) and that of never pooling (X = 0), the results for X at the 1 percent, 
6 percent, 20 percent levels and for X = 1 have l)een tabulated. By subtracting 
unity from the results the bias is obtained as a fraction of <ri. The Table was 
computed from the incomplete beta function Tables [6]. 

When the two mean squares are always pooled, the fractional bias is negative 
and increases numerically as <rl becomes small relative to a\ . By examination 
of the values in Table I for <rl/crl = .1, it will be seen that the preliminary test 
of significance controls the bias well when al is much smaller than <7? , that is 
when a large bias from pooling is most to be feared. This result happens be¬ 
cause in such cases the preliminary test allows pooling only in a small propor¬ 
tion of samples. 

If X is taken at the 1 or 6 percent levels, the maximum bias appears to occur 
when al/<n is in the region 0.4-0.6, there being little bias when crl is near al . 
The lower values of X (20 percent or X equals 1) control the bias satisfactorily 
in the region <rl < .fieri, but have a fairly substantial positive bias when erj = <r!, 
that is when pooling would actually be justified. By use of the relation between 
the incomplete beta function and the sum of the terms of a binomial series it 



caa showB t)iat k aiways a positive biM when v! » vr and fiiir 

given numbers of degrees of freedom this bias is greats when X » 1. 

To sununarize from the example in T^le I, it seems that fm smidl values of 
th and nt none the values of X which have been investi^ted controls the bias 
throughout the whole range 0 ;< <r|/<ri 1. 

TABLE I 

Expected Value of e*/<r\: E(e*)/a\ 


Case 1: tii i- 4, ni, ■ 20 



8. The variance at e*. Using the same method we may obtain the variance 
of e*. The final result is 

ni(ni + 2)J,,(ini + 2, \n^o\ + 2ninjI,,(Jni + 1, in* + l)vio'| 

_ 4~ Titjnt + 2)Z„(^ni, ^n» -{■ 2)<rj 

(wi + n*)* 

+ 11 - + 2 , in,)]«rl - [l + --5-r 

111 L + «* 

> i«» + 1) p “■ + 1, (Ti. 


( 8 ) 
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1^ 

From the relations in deri\dng this result the following identity was obtain^: 
(wi + n 2 + 2)(ni + n 2 )/»o(iwi, ^^ 2 ) = ni(ni + 2)/*o(Jni + 2, 

+ 2nin2/«o(iwi + 1, |n2 + 1) + + 2)/,^(ini, + 2). 

This identity can be readily established by elementary calculus. 

As a check on the result in equation (8), we note that if X — 00 , then Xo = 1, 

and 4 < X always. The variance of the estimate of variance becomes 

2(ni<ri + 7 h(ri)/(ni + 112 )^, which checks with the variance of (njs? + thsl)/ 
(ni + 112 ) for the case of always pooling. If in addition iri = 0 * 2 , then 
V = 2(r\/{ni 4“ W 2 ). If X = 0, then Xo == 0, and sl/sl ^ X always. The variance 
of the estimate of variance becomes 2 (Ti/ni which checks with the variance of s? 
for the case of never pooling. 

The expression for the variance of e* enables us to investigate how much has 
been gained in terms of reduction in variance by the use of the preliminary test. 
The quantity [V + (Bias)^} is the appropriate value for the whole sampling 
error, where V is given by (8) and the bias by (7). For the two numerical 
examples these quantities are shown as fractions of ai in Table II. 

As a standard of comparison the variances of the estimate si (no pooling) will 
be used. In these examples the preliminary test with X = 1 pnxiuces a variance 
smaller than that of for all values of aXjfrX except the lowest (0.1) where the 
two variances are equal. As X is taken successively higher there is a substantial 
reduction in variance when crl is near al but an increase in variance over that of 
81 when (rl/aX is small. Throughout nearly all the range of values of (nhX , 
the smallest variance is obtained by always pooling (X = 00 ), despite the rela¬ 
tively large bias given by that method. This result is a reflection of the in¬ 
stability of estimates of variance which are based on only a few degrees of 
freedom. 

III. EXAMPLE TWO: TEST OF A REGRESSION COEFFICIENT 

1. Regression and some properties of orthogonal functions. Let 

y = 0iXi + P 2 XH + e 

be a linear regression of y on the two variates Xi and X 2 in which and ^2 are the 
respective population regression coeflicients and e is the error. We assume that 
Xi , X 2 and y are measured from their respective sample means and that the values 
of Xi and X 2 are fixed from sample to sample. In order to make comparisons 
among samples of different sizes we assume that xi and X 2 have unit variances 
and correlation coefficient^ p so that 

* S{xl) = n - 1, 8 {xl) = n - 1, S{xxXi) = p(n ~ 1), 

^ Although p is commonly used to denote a population correlation coisfficient, we are 
using it here for the sample correlation coefficient between the fixed variates Xi and . 
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Case 1: ni ■• 4, tti « 20 


a’{/«i 



.1 

<2 

.3 

.4 

.5 

.6 

.7 

.8 

.9 

i.o 

X *=00 

.577 

.462 

.360 

.275 

.205 

.149 

.111 

.087 

.076 

.084 

X.oi *= 4.43 

.560 

.620 

.603 

.523 

.350 

.323 

.243 

.184 

.160 

.137 

X.06 = 2,87 

.514 

.545 

.564 

.528 

.479 

.414 

.353 

.299 

.260 

.237 

X .20 “ 0.00 

.601 

.500 

.493 

.480 

.458 

.435 

.408 

.374 

.367 

.360 

X =1 

.499 

.493 

.480 

.462 

.441 

.423 

.401 

.389 

.381 

.387 

il 

o 

.500 

.500 

.500 

.500 

.500 

.500 

.600 

.500 

.500 

.500 


Case 2:ni » 12, ni ^ 10 





.1 

.2 

.3 

.4 

.5 

.6 

.7 

.8 

.9 

1.0 

X =00 

.217 

.183 

.154 

.130 

.112 

.097 

.088 

.084 

.085 

.091 

X.ox = 4.71 

.185 

.218 

.203 

.171 

.141 

.118 

.103 

.094 

.092 

.096 

X.06 = 2.91 

.170 

.187 

.194 

.183 

.163 

.142 

.125 

.114 

.109 

.109 

X.?o = 0.00 

.167 

.169 

.171 

.170 

.164 

.156 

.146 

.139 

.136 

.135 

X = 1 

.167 

.166 

.165 

.162 

.158 

.162 

.148 

.144 

.144 

.147 

X =0 

.167 

.167 

.167 

.167 

.167 

.167 

.167 

.167 

.167 

.167 


where (S(a:i) denotes summation of x\ over the sample, with similar meanings 
for S{x\) and S{xiX 2 ), where n is the sample size. 

We make the orthogonal transformations 

^ = Xi, ^”*53“" P^l , 


then 

y = /3ift + + p{i) + c. 


But 

= n - 1, Sti = (n - 1)(1 - p\ -S(f,&) = 0, 

therefore 

= ft(n - 1) + ftp(n - 1) + -g(a5ie), 

and 

'S(yfe) = / 8 j(n — l)(l ~ p) + S(*2 ~ P»i)e. 
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Now if we represent the regresfflon coefficients of y on the (’s as B’s we have 
fiiStti) = S(hy), = S(6y). 

The reduction in the total sum of squares due to Xi ignoring xt is 

„ , [-S({iy)P [(n - l)(ft + Ap) + S(»,e)]* 

B,S(vh) - • 

The reduction in the total sum of squares due to Xt after fitting X\ is 
„ ,, I<S(fel/)? [(n - 1)A(1 - P») + 8{.xt - paQep 


The reduction in the total sum of squares due to regression is 

l(n - 1)(A + Ap) + -S(a:,e)]* , [Sc(®, - px,) + (n - 1)A(1 - p)f 

(n - 1)(1 - p*) 

in which the two parts are independently distributed. 

Let hi be the regression coefficient of y on Xt when the term containing ®2 is 
Omitted from the regression equation. Now, 

/ B((iy) (n — 1)(A + Ap) + S(xte) 

“ S(i!) “ « - 1 


Hence 

(9) E(b'i) =* A + Ap* 

Let hi be the regression coefficient of y on aj 2 if both xi and X 2 are used. Then 

V _ I? _ _ (w - 1)A(1 - P^) + S(xs - px,)e 

S(f?) ■ (n - 1)(1 - p*) 


And 


V(bi) = 


3(^2) 

[S& 


1 


1 

(n- 1)(1 -p»)' 


The normal equations for Y = biXi + hxi are 



biS{x\) + biSixiXi) = S{xiy), 
biSixiXt) + bjSixt) = S(xty). 

Now 

S(xiy) ^ ^ S(xix») 

Therefore 

ss: 61 + biPj 

or 

61 « 5i — 6 ip. 
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(10) E(bi) = fii + hp ~~ i>E(bt). 

We notice that if p 0, &i is unbiased in any selected portion the populhtion. 

2. Statement of die problem. To obtain an estimate of &i, in a particular 
analysis in hand, in which it is desirable to choose by memis of a test of si^hi'^ 
ficance between using the regression equation Y = hxi + htXt and F' «= , 

we formulate a rule of procedure. 

S. Rule of procedure. Calculate the following analysis of variance: 

Degrees of freedom Sum of squares Mean square 


Reduction due to xi 

j [(n — l)(ft + ftp) + S(xie)]* 

Si 

n — 1 

Reduction due to xs 

[(n — — p*) + 8(xi — pxi)ef 

8? 

after fitting xi 

(n - 1)(1 - p*) 

02 

Residual 

n - 3 S(y - F)* 

si 


Test ^ by the F-test. If F is non-signihcant at some assigned significance 
level we omit the term containing xt and use 

(n — l)(/3i + d*p) + S(a:ic) 

0i =--^- 

n 1 

as the estimate of j9i. If F is significant, we retain the term containing xs, 
and use bi as the estimate of • The estimate obtained by this rule will be 
called b*. 


. 4. Object of this investigation. If we follow such a rule of procedure, what 
will be the bias in 6* as an estimate of /8i ? 


6 . Matiiematical derivation of the bias. First, we wish E{b'i) when 


i<\ or 
8 * 


8 | si '(l-p»)(n- D’ 

where X is the value on Snedecor’s F-distribution corresponding to some assigned 
significance level for 1 and (n — 3) degrees of freedom. From (9) we have 

(11) E{bi) = ft + ftp, 


no matter what the value of A ; since from section 1, and si are independently 

s# 


distributed. 
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Next we wish ^( 6 i) when ^ ^ Xor^ S 7 --—-To obtain this WB 

»« 8* (1 - P*)(n - 1) 

find it more convenient to find first E(bt) when 


where 


6* V. X Vt ^ I 

8| - (1-P*)(n-1)’ 61 -Xcm’ 


V* = «; and C 22 « 


^ ‘ (n-l)(l^p»)- 

The joint distribution of 62 , t's is 

where £ is a constant. We make the transformation of variables 

u — dvt ^ bldu; 

Oi 

then the joint distribution of h and u is 

Taking the expected value when ti < ~ we have 

AC 22 


E{bt) « 


'K-i) 

0 1/^C22 


h 4. " « 




where V 2 = S 2 , and PI 


(" - it) 


-‘>exp[- 


(62 - (n - 3) ,2 


b2u\dudlHy 


is the probability that u be less than or equal 


Dropping subscripts for convenience and expanding the factor which involves 
c to the first power of 6 , we have 


^ «.^l/ 2 c -00 -i/Xr 

m) ^ . . - X / I 






+ ••• 


where 


— «<6<oo 0 <M<;-. 

Xc 




fiiil 

Now, cleiurly the even terms of the aeneO vimish whether n is odd oT^'^^^e^ijli^ 
h is integrated out.' ' 

After integratimt with rrapeet to h, wei have an infuute sMies the 

typical term (apart from cmistants} is of the form 

^ _ 3)^]l<»« 

where r is an even positive integer. Subsequent integration with reepeet toja 
leads to an infinite series of incomplete integrals of the F distributicHa. By 
transforming the integrals, the series may be expressed in terms of incomplete 
beta functions as follows: 


w 




Let 


+ 


, P‘)(« - 

a._ or a-^-' 


Then we have 



and X is the desired % point of the F-distributioa for 1 and (n — 3) degrees of 
freedom. Now from (10) we have 


F(hi) •“ /Si + 0ip ~ pF^) 


which enables us to obtain F(bi} from (12). 


6 . Ibial remit From (10), (11) and (12) we have 

E(b*) - + ft'*) + ^ 

- /Ji + pA - [l ~ P(^ < x)] PF(6,). 
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The biaa in b* is 




Substituting the value of E(bi), we obtain 




where xo = 


, a = (1 




7. Checks. From section 5 we have 


_f-i*_ 




« - 3 3 , 

e% 9 rk *• ^ i 


where Xo = 


n - 3 ^ 

If X = 0, then aJo = 1, and Eibz) = ft . 
Also from section 5 we-have 


Bias = p/32 1^1 - S 


-3 3 , 

F" ’ 2 


If X = 0, then Xo = 1, and Bias = 0. 

If X = <», then aki = 0, and Bias = pfit. 

8. Discussion. From the mathematical form of the bias, 


Bias = p/ 32 j^l - ^ ~ ^ 


n - 3 3 , 
~2~ ’ 2 


where *0 = 


n - 3 

four deductions follow immediately: (i) There is no bias in estimating /3i, if 
p or /32 is zero, (ii) The coefBcient of $2 in the formula is less than or at most 
equal to one in absolute value, (iii) The sign of the bias depends upon the signs 
of p and /32; it is positive if both are positive or both negative, it is negative if p 
and /32 have opposite signs, (iv) The bias is estimatii^ /3i is independent of /3i. 

We shall discuss the bias in a few special cases by means of selected values of 
n, p, fit and X. In Table III exhibited the values of the bias for n equal 
to 5, 11, 21, each at p equal to .2, .4, .6, .8, and fit equal to .1, .4, 1.0, 2.0, 



aod 4A These values have bem computed at the j5% poiut for X; a|id;# jli 
these spedid cases seem to indicate: 0) If we fix p, /%, and X an^ 
thmi the bias decreases, (ii) If we fix p, ^, and n and ehaufe X firoinithe 5% 
pdnt to X a: 1 , theibias decreases considerably, (iii) If we fix p, n, X and increase 

TABLE III " 


The Bias in EsHmatingfii 



X.06 - 18.513 
» — 5 

X.efi 5.318 
n ■» 11 

X.06 1^4.414 
n - 21 

ft \ 

.2 

.4 

.6 

.8 

.2 

.# 

.6 

.8 

.2 

A 

.6 

.8 

0.1 

.017 

.034 

.051 

.069 

.015 

.030 

.046 

.061 

.014 

.029 

.044 

.059 

0.4 

.067 

.134 

.202 

.272 

.049 

.101 

.159 

.227 

.033 

.071 

.122 

.193 

1.0 

.142 

.292 

.455 

.640 

.028 

.072 

.164 

.350 

.001 

.006 

.025 

.132 

2.0 

. 162 

.358 

.627 

1.038 

.000 

.000 

.001 

.083 

.000 

.000 

.000 

.001 

4.0 

.035 

.101 

.282 

.898 

.000 

.000 

.000 

.000 

.000 

.000| 

.000 

.000 



X - 1 



X * 

1 



X - 1 




n 5 



n -• 

11 



n « 21 


P 

.2 

A 

.6 

.8 

.2 

.4 

.6 

.8 

.2 

.4 

.6 

.8 

0.1 

.004 

.008 

.011 

.015 

.004 

.008 

.011 

.015 

.004 

.008 

.011 

.015 

0.4 

.012 

.026 

.040 

.057 

.009 

.019 

.032 

.051 

.005 

.011 

.022 

.041 

1.0 

.011 

.025 

.049 

.095 

.001 

.003 

.010 

.039 

.000 

.000 

.001 

.009 

2.0 

.000 

.002 

.008 

.043 

.000 

.000 

.000 

.000 

.000 

.000 

.000 

.000 

4.0 

.000 

.000 

.000 

.000 

.000 

.000 

.000 

.000 

.000 

.000 

.000 

.000 


ds the bias increases and then decreases. This may be explained in the following 
manner. From section 0, the bias may be written in the form 


< X^ « /n ^ \ 

a Vs / ■^ a e f in — o a . .\ 

5 TT ■ 2+V ■ 


I is relatively 


Now if p, n, X are held constant and 02 is relatively’: small, P 

large and 2 2~ ’ I ^ relatively large, but > X^ is rela¬ 

tively small. Hence, for a while as we increase 02 the bias will increase, but as 0 t 

gets larger p(- < x') and 

\vs / <-o \ 


" , 5 + bectanes smaUer while 


p(^ > X^ becomes larger. Hence, a value of 0 % will be reached at which the 
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bias begins to decrease, (iv) If we fix n, /3j, and X and increase the bias in¬ 
creases without exception. 

The above results were obtained under the assumption that a test of signi¬ 
ficance criterion is used in making a choice as to the number of independent 
variables to be retained after the regression y = hxi -|- has been fitted. 
Tf this test of significance criterion is used, we may wish to have a means of 
controlling the bias. From a‘ study of Table III we note that the bias may be 
decreased by increasing n and by using X » 1. We also notice that as /9t in¬ 
creases from 0.1 to 4.0 the bias increases and then decreases; and so passes 
through a maximum value. Hence, if we have a regression in which $» is fairly 
well below or above this maximum value, we would expect a smaller bias. 

The bias in estimating is “unstudentized,” i.e., is a function of the population 
parameters p and /Sj. In any particular analysis in hand, it would be necessary 
to know the values of p and fit or be willing to use estimates of them obtained 
from the data. 

It is realized that only a beginning has been made on the regression problem: 
an investigation should be undertaken of the more general problem of the use of a 
test of significance criterion in making a choice as to the number of independent 
variables to be retained after the regression 

y = biXi + btXi + - ■ ■ + b„Xn 


has been fitted. 
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THE FROBABIUTir OF COlfViEROfilVCE OF AN ITBiUlTIVE FtlOd^ 

OF mwisaimQ a matrix 

Bt Joseph Ullman 
CQltmbia Univenity 

Introdoctioii. The inversion of a matrix is a computational problem of wide 
application. This is a further study of an efficient iterative method , of mat^ 
inversion described by Harold Hotelling [1], with an examination of the prob¬ 
ability of convergence in relation to the accuracy of the initial approximatioii. 
The lines of investigation were suggested both by his article and by helpful 
comments made during the course of the research. 

The inverse of a matrix can be obtained to any desired degree of accuracy by 
using a variation of the Doolittle method, and starting with a sufficient niunber 
of accurate decimal places in the matrix being inverted. This procedure be¬ 
comes inefficient if the order of the matrix is large, or if the desired degree of 
accuracy is very great. In either case the efficiency can be greatly increased 
by first obtaining an approximation to a small number of decimal plaoes and then 
applying a method of iteration until the desired accuracy is achieved. 

1« Iterative methods* Hotelling’s method of iteration is as follows. Let A 
be the matrix to be inverted and let Co be the approximation to the inverse. 
Calculate in tvm Ci, C 2 , • • • where, 

(1.1) ^ C^{2 ^ AC^). 

This sequence of matrices will converge to the inverse of A if the roots of 

(1.2) A = 1 - ACo, 

are all less than unity in absolute value. 

The iterative method (1.1) will be generalized to yield a class of iterative 
methods, one element of which will be shown to be more efficient, in certain cases, 
than method (1.1). The generalized iterative method is, 

(1.3) C'.+j = C,{1 + (1 - AC^) + (1 - AC^f + ••• + (!- 

For every k, the condition for convergence is that the roots of the matrix (1.2) 
all be less than unity in absolute value. 

A method of comparing the efficiency of these different iterative mef&ods 
arises from the following considerations. Since 

(1.4) C,-A-*(ACo), 
indiich is equivident to 

(IJ) Co » - D), 

205 
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it fdlowB that 

(1.6) A"‘ = Co(l - D)~*. 

When the roots of D are all less than unity in absolute value, (1.6) has the 
infinite expansion, 

(1.7) A-' = Coil + D + D* + D* + ■■■). 

The general iterative process (1.3) generates the infinite series in the following 
manner, 

(1.8) (1 + i) + /)'+••• + + /)' + •••+ 

Each parentheses corresponds to one iteration. Hence k”" terms are generated 
by m iterations. In order to achieve the accuracy of n terms in (1.7), m = 
log* n/loge k iterations are required. Each iteration involves k matrix multi¬ 
plications, so that km — k loge n/loge k is the total number of matrix multiplica¬ 
tions necessary to achieve tliis degree of accuracy. 

The integer for which this is a minimum is three. Therefore the “most effi¬ 
cient’^ method of iteration is, 

(1.10) C„41 = C„{1 + (1 - ACm) + (1 - AC„f\. 

If the desired degree of accuracy can be achieved by one application of (1.1), 
or by two applications of (1.1) but not by one application of (1.10), then (1.1) 
is preferable. 

2. The condition for convergence. The sequence, 

(2.1) (7x, C2, C,, 

obtained from (1.3) will converge to the inverse of A if the roots of 

(2.2) D = 1 - .4Co, 

are all less than unity in absolute value. The following assumptions determine 
the nature of 2). 

We assume that the expected value of each eleriient of the first approximation 
Co is equal to the corresponding element of the exact inverse of A. The actual 
values of the elements of Co will deviate from their expected values. We will 
consider two important cases. If the deviations are entirely due to the fact that 
the elements of Co are only accurate to a limited number of decimal places, say 
ft, then the deviations may be regarded as distributed with constant density over 
a range of length 10""*. It will be assumed that the deviations of the elements 
of Co from their expected values are independent. While this case arises in 
practice, w^e will first treat a closely related case, which lends itself to exact 
treatment more readily. We assume that the deviations of the elements of Co 



■ -C»NVBB0B«rC£^X>F 'AK^13;il«ik1SV£ PBOC^ ': ' ;. 

are tu»maUy distributed abmt their ej^ieeted values^ with variaaoeit » 

The variauee m is the same as tiiat which arim if th» probabUity deu8% is 
form witii range 10~\ : 

The elements of £7, the matrix of deviations, 

(2.3) E - Co, 

are independently and normaUy distributed. Combining (2.2) and (2.3) we 
obtain 

(2.4) D = 1 - ACo = A(A"‘ - Co) * AE. 

Let p be the order of the matrix A. Each element of D will be a linear com¬ 
bination of p independently and normally distributed Variables, and therefore 
will itself be normally distributed. A sufficient condition for all the roots of D 
to be less than unity in absolute value, and hence for the process of iteration to 
converge, is for the sum of the squares of the elements of D to be less than unity 
in absolute value. We will use the following notation 

(2.5) dij : the element of D in the ith row and jth column, 

Nil'Ll: d%- . 

* i 

A procedure Huggested by this relationship is to determine the probability 
distribution of No , so that probability statements concerning the absolute value 
of the roots of D can be made. Because the elements of D are tiot all inde¬ 
pendent, no multiple of N], can be expected to have the X(p*) distribution.* 
nie distribution of N]) is shown to be closely related to the chi-square distribu¬ 
tion in the next section, and on the basis of this I'elationship, lower bounds to 
the probability of convergence of the iterative process are developed in section 4, 
In section 5 the exact distribution of the norm is obtained for a general class of 
cases. The final section is concerned with the validity of applying the results 
of this study to a practical situation, where the deviations of the elements of Co 
from their expected values are uniformly, rather than normally, distMbuted. 

3. An equivalance. Let be the element of E in the ith row and the jth 
column, and a,-; be the element of A in the ith row and the jth column. From 
(2.4) and (2.6), we find that 

dij ~ ^kj • 

* 

Since the elements of E are independently and noi-mally distributed with 
variance m = 10'"^*/12 it follows readily that 

(3.2) E[€ijekh] = 

^ The number in the parentheses will indicate the number of degrees of freedom of the 
chi-square distribution. 
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Making use of (3.1) and (3.2), we find that for two dtj in the sanne ooluma, 
(3.3) E[dijdt^ = n Oucku > 


while for any two dif in different columns, 

(3.4) E[dM = 0. 

From (3.3) and (3.4) it follows that it is permissible to regard the elements of 
the p columns of D as the coordinates of p independently selected points from a 
multivariate normal universe with covariance matrix <r = fiAA'. We will let 
X == <r\ 

Hie moment generating function of the sum of squares of the coordinates of 
any point is 

This can also be written as 


(3.6) 


1 

(r-^5^,0*(l - ••• (1 - arpO* ’ 


where <ri, • • • , jp are the characteristic roots of or. 

Since iV© is the sum of p independent expressions of this type, its moment 
generating function is the pth power of (3.6), 

(3.7) • 


This expression is the moment generating function of 

(3.8) vixLii + v*X(p)j + • • • + vpX(p)p > 

where the xcpx are all independent. 

Writing the roots as 


(3.9) Vo, Vo ~ /ci, • • • , Vo ~ kp—i , 

\riiere vo is the largest root of v, and all ki > 0, it follows that N% has the same 
distribution as 


(3.10) 


VO 2 X(p) 
i 




y-i 


X(p)y 


Therefore, making use of the reproductive power of x*> we obtain 


P{JVo < Ij * P 


(3.11) 


{'•? 


x\p)i < 1 + ^ kjx\p)i 


< 1 + 


/-I ) 
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By malditg special assumptiODifl about tbe daw af^roanuitiaQs^^^^M 
probabUity that ATo will be leas than one, andh^ce that prooe^ of iterate 
will converge, oan be obtained. Instead ctf following this propedute,it is nuiite 
desirable to have definite lower bounds for the probability that Ni> will be less 
than one. This wUl lead to an overstatement in the number of demma^ places 
of accuracy necessary in the first approximation Co to ai»ure {convergence, but 
it will practically eliminate the possibility of having to recalculate the first 
approximation, and hence will lead to greater efficiency in the Irmg run. 

4. The derivation of the formula for detenniniog the required d^ree ol ac¬ 
curacy. The inequality used in this section is derived in two steps from (3.10). 
Since > 0 (i = 1, • • • , p — 1) it follows immediately that 

(4.1) P{No < 1} > P{v«X(p.) <• 1}. 

In order to use this inequality, the upper bound for <ro 

(4.2) (TO < (tr 
can be used. For < = 1, 

(4.3) (To < tr <r = tr fiAA' = M tr AA^ = ijlNa • 

Dr. Wald pointed out that using (4.2) for < = 1 reduces the amount of informa¬ 
tion retained in (4.1) to that which is contained in the inequality, 

(4.4) N(D) < N(A)N{E). 

A closer upper bound is feasible in any particular case, and can be introduced 
at this point by letting ^ = 2 or < = 3. Hie following formula will be developed 
for the general case, making use of (4.3). 

Substituting (4.3) in (4.1), we obtain 

(4.6) < 1\ > P ixU < 

It is desirable to separate the effects of the order of the matrix A on con¬ 
vergence, and the order of magnitude of the elements. Hence we introduce as a 
measure of the average size of the a,y their root mean square m, so that 

(4.6) w* = 2 2 

t jf 

Hence 

(4.7) Na = pm. 

The final form of the inequality is 


(4.8) 
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First we will obtain an expression for the number of decimal places required 
in the first approximation to make the probability of convergence at least .999. 
Then the expression will be checked directly by means of (4.8) and tables of the 
chi-sqiuire fimction. __ 

For large values of p, y/ 2x(pS) is approximately normally distributed with 
mean value \/2p* — 1 and unit variance [2], [5]. Applying this transformation 
to the right hand side of (4.8), and noting that 3.1 standard deviations is slightly 
greater than the deviation corresponding to .999, we obtain as the condition for 

«.9) < '^] > .999 

or 

(4.10) p|2x!,., <2.!^}>.999 

that it is sufficient that 

«•“) 4 /^^^ - > 3.1. 

This is equivalent to 

k > logio p + logic m + logiofV^ - 1 + 

(4.12) \ V2/ 

- Jlogio 24 + logic 

Since the characteristic of a logarithm is insensitive to the argument, rounding 
off will introduce a negligible error, and we finally obtain an upper limit to the 
lower bound of k, 

(4.13) k > logic m. + logic p + logic (p + 3) - .55. 

In order to verify the accuracy of (4.13) for small values of p, certain values of 
p, k and m are chosen and the probabilities associated with (4.8) determined [2]. 
The entries in brackets are the corresponding values of k determined from (4.13). 

A typical example will illustrate the use of table on facing page. Let the matrix 
A to be inverted be a fourth order correlation matrix. The mean magnitude m is 
about i and p = 4. If the first approximation Co is obtained to one place accuracy, 
then the probability that the sequence Ci, C*, • • • will converge to will be 
greater than .999. Using formula (4.13), we obtain k - .53. Since one is the 
first integer greater than .53, the table verifies the use of the formula. 

Although the formula was developed on the assumption that p is large, every 
value calculated is consistent with the table. This lends support to its use for 
small values of p. 
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The ProbabiUtjf ef Cotmrgenee of the lUaro^oe Proeek^ 



\ P 






fc\ 

2 

3, 

4 



-1 

0+ 

0+ 

0+ 

0+ 

m = § 

0 

[.051.982 

[.331.199 

[.5310+ 

[.7010+ 


1 

1- 

1- 

1- 

' 1 _ ■ . 


-1 

0+ 

0+ 

0+ 

0+ 

m » 2 

0 

[.851.051 

[.9310+ 

0+ 

0+ 

. 

1 

1- 

1- 

[1.131.715 

[1.3010- 


2 

1- 

1- 

1- 

1-r 


0 

0+ 

0+ 

0+ 

0+ 

m = 10 

1 

[1.351.439 

[1.6310+ 

1[.8310+ 

0+ 


2 

1- 

1- 

1- 

[2.0011- 


* “1 — ” means greater than .999. 

It has already been pointed out that k is not sensitive to rounding off of the 
argument of the logarithm. Thus for p = 20 and m = 2, we can let logic m = .3, 
logic p = 1.3, logic (p + 3) = 1.36 and obtain 

A; = .3 + 1.3 + 1.36 - .55 = 2.41, 

from which it follows that three decimal place accuracy in Co will practically 
insure convergence of the iterative process. 

6. The mean, variance, and exact distribution. To obtain the moments of 
iVo , the most convenient form to use is (3.8). Since the xLx are independent 

(5.1) .B[A^cI = <r.X(p)<] = p Z) <r.-. 

i t 

<Tsh = E[Nl] - iElNl]f 

= <r<(x(|,).)* + 2 X X(p)ixL)» «r. ‘r;l — (p ]C «■*)* 

(52) * 

= (2p + p*) 52 <r< + 2p* 52 o’iVj — P* 2 — 2p* 52 

i i<i i *</ 

= 2pE<A. 

i 

These can be expressed in terms of the elements of .4 and the variance of the 
elements of E, since 

E « *= tr W) = M tr (AA') » , 

i 

E = tr ff* = M* tr iAA'AA'). 


(5.3) 
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The exact distribation of iVi can be obtained readily vhen p is even, 
case the infinite integral, 


(5.4) 


1 r+* 

2ir (1 - i2<rit)Pl* ... (1 - i2a, <)"* ’ 


Inlius 


can be evaluated by contour integration. The integral satisfies the conditions 
given in Whittaker and Watson [3, sec. 622], if the semicircle of the contour is 
taken on the lower half of the complex plane. 

For the case p = 2, for example, there are simple poles, at < = n~>n~' 

Z<Xi 

The sum of the residues at these poles, multinlied by i 3 delds the exact distribu¬ 
tion: 


(5.5) 


(Ti 6 

2{(ti — oTj ) 


+ 






2(0*8 <ri) 


For even values of p gi*eater than 2, the values of the residues can be obtained 
by repeated differentiation. 


6. Summary. We are now in a position to discuss the applicability of the 
results of this paper to the problem which arises most frequently in practice. 
The elements of the first approximation to the inverse will deviate from their 
expected values only because the first approximation is carried to a limited 
number of places, say h In this case the deviations will be distributed with 
constant density over a range of length 10”"*. The elements of E, the matrix of 
deviations, 

(6.1) E ^ ^ Coy 

are now each independently distributed, but with uniform density, range 10"* 
and mean equal to zero. From (2.4) 

(6.2) D = AE, 

we observe that each element of D will be a linear combination of p independently 
and rectangularly distributed variables, each with mean zero and range 10"*. 
The analysis of sections 3, 4, and 5 will be valid if di, can be considered to be 
normally distributed. 

There is much experimental evidence and theoretical justification for assum¬ 
ing that the elements of D are normally distributed. A sufficient condition that 
the dij approach normality as p increases is that the sum of the a?y in any row of 
A he divergent as the order of the matrix approach infinity, while at the same 
time every element be less than some constant value independent of the order 
of the matrix [4]. 

The experimental and theoretical evidence supporting the approach of the 
dij to normality, the fact that the logarithms are insensitive to errors of approxi- 



. co]mi«»»r(® ;oir'AK itiSiUTiiv® PRot^TO 

■ ' , , '' ■ ' ■ ■' ■' ■■V . ; 

di coQverg&a(» olt the iterative are used, all lend support to tl^ 

k > logio m + logio p + logio (p + 3) - .i5. 

for determining the number of places (h) neoessmy in lire first i^prpximatimi 
(Co) to the inverse of A, a matrix (tf onter p whose elements have meur nse m, 
to malm the probability at least .999 that the process of interation will yield a 
sequence of matrices which will converge to the true inverse. The ultknate 
justification of the use of this fonnula can only be by the results of its aptdicar 
tion in practice. 
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NOTES 

This section is devoted to brief research and expository articles, notes on mrihodohgy 
and other short items. 


ON DISTRIBUTION-FREE TOLERANCE LIMITS IN 
RANDOM SAMPLING 

By Hesbebt Robbins 

* 

Post Graduate School^ Annapolis^ Md. 

Let , • • • , Xn be independent random variables each with the continuous 
and differentiable cumulative distribution function a{x) = Pr(X< < x). A 
continuous function f{xi, • • • ,Xn) with the property that the random variable 
Y = , • • • , Xn)) has a probability distribution which is independent of 

o'(x) will be called a distribution-free upper tolerance limit^ (d. f. u. t. !.)• We 
shall prove 

Theorem 1. A necessary and sufficient condition that the continuous function 
f(xi, • • • , Xn) be a d. f, u, t. L is that the function 

n 

/(a:, , • • • , x„) = n {fixi , • • • , a!„) - *<} 

be identically zero. 

Proof, Since / is continuous, we can prove the necessity of the condition by 
deriving a contradiction from the assumption that / is a d. f. u. t. 1. for which 
• there exist distinct numbers ai, • • • , Un such that f{ai , • • • , On) = A 9 ^ ai, 
(i = 1 , ■■■ , n). 

Since the numbers ai, • • • , a,*, A are distinct, there will exist a positive 
number c such that the (n +1) intervals 

J; A — €]^x^A”i“€ 

a* — € < X < a,- + 6 (i = 1, • • • , n), 

have no points in common. Moreover, since / is continuous, there will cor¬ 
respond to € a positive number €1 < e such that 

A - t< f(xi , • • • , Xn) < A + €, 
provided that simultaneously 

I Xi ~ a< I < €1 (i = 1, • • • , n). 

Now let p be any number between | and f. Corresponding to p we define 
the function <rp(x) as follows. In the interval I we set <rp(x) = p. In every 
interval 

Ji: Ut €1 ^ ^ ^ ~ *' * > 

' Cf. S. S. Wilks, Mathematical Statistics, Princeton University Press (1943), pp. 93-94. 
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we let ffpix) increase an Outc^e the intervals j, ji, • • 

we define <rp(.x) in any manner so ^t it is cont^ous, differential^ aii^ lioa^ 
decreadng for every *, and has to properties ff,(— «) * 0, (r,(oe) = 1.. Jt is 
clear that we can do tins. 

Let S denote tiie set of all points («i, - - - , Xn) of n dimensioDai sndk 
that simultaneously 

1 Xi — 1 < «i (» »» 1, • • •, n). 

Then by cmistruction, for a, (x) defined above, 


PrdXi, ••• ,X.)«S) 


(a' 


But if (Xi, ••• , Xn) e S, then by construction, 

A -e</(Xi,--.,X.)<A + « 

and 


F = ap(/(Xi,...,Z,)) = P. 

Hence for a(x) — <rp(x) we have 

Prir - rt > (ly. 

But since / is a d. f. u. 1.1., this inequality must hold for any a (x). 
Now choose a set of numbers 


J < Pi < P2 < • • • < Pm < f, 
where m = 2(3n)’'. Then from the above, 

Pr(V =5 one of the numbers Pi, • • • , Pm) > 2. 

This is the desired contradiction. 

Let Or(xj, • • • , a^n) be the function whose value is the rth term when the num* 
bei-s a:i, • • , Xn are arranged in non-decreasing order of magnitude. In tarns 
of the functions Or we can characterize the continuous functions / which satisfy 
the identity / = 0 as follows. I^et , 4 be a permutation of the integers 

1, • • • , n. Denote by E(ii, • * • ,in) the set of all points (xi, • • • , Xn) such that 

X<i < Xij < • • • < Xi^ . 

The n! sets E are open and disjoint. Since / is continuous and / « 0, in each 
E(ii , * * * , in) we must have, for some r, 

f{Xi , • • • , Xn) ^ Or(Xi , * • • , Xn)i 

where the integer r = r(ii , • ,in) must depend on the permutation ii, • • • , in 

in such a way that / may be extended continuously over the whole space. (The 
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condition fw this is as follows. Two permutations »i, • * • , wd ji, , j» 
may be called adjacent if they differ only by an interchange of two adjacent 
integers. Then for any two adjacent permutations, either r(ti, • • - , t*) “ 
*■0*1» • • • »i*) or the two values of r are the two interchanged integers. For 
example, tire fimction 


/(xi ,Xt,Xt) 


0»(xi ,Xt,Xi) if Oz{xi , xj , xa) =* Xi 
P 2 (xi, X!, X|) otherwise 


satisfies this requirement.) 

We shall now prove that the necessary condition, / ^ 0, of Theorem 1 is suf¬ 
ficient to ensure that the continuous function / be a d. f. u. t. 1. From the 
argument of the preceding paragraph, any continumis function f such that 
/ ■ 0 will in each set E{ii , • • • , in) have the value 0,(xi, • • • , Xn), where r is 
an integer from 1 to n. Since the variables Xi, • • • , Xn are independent and 
have the same probability distribution, the probability that (Xi, **■ , X«) will 
belong to E(ii , • • • , in) is equal to (1/n!) for every permutation ii , • • • , in . 
Let 


W=/(X,,...,Xn). 

Then if <pix) =» daix)/dx denotes the probability density function of each X,-, 
the conditional p. d. f. of W * Or(Xj, • • • , Xn), given that (X,, • • • , X,) 
belmigs to X(ij, • • • , in), will be nl^,(«>), where 

,1 f,„\ - - (r(w)]""’' 

- (r - 1)1 (n - r)l ' 

Thus ^r(w) will be of the form 

^r(w) * <f>{w)FrW{w)), 

where Fr(o’(w)) is a polynomial in v(w). Hence the conditional p. d. f. of F <= 
ffClF), given that (Xi, • • • , Xn) belongs to E(ii , • • • , in), will be n!$,(y), where 

ir(y) «= Fr(y), 

and the p. d. f. of Y will be 

f(v) - 2F,(»), 

where the summation is over the nl integers r = r(ii, • • • , in). This is inde¬ 
pendent of v(x), so that / is a d. f. u. 1.1. This completes the proof of Theorem 1. 

A function f{xi , • • • , Xn) is symmetric if its value is unchanged by any permu¬ 
tation of its arguments. It is clemr that the only continuous and symmetric 
functions / which satisfy the identity / « 0 are the n functions Or(xi, • • • , x*). 
Hence we can state 

Thbobbu 2. The only symmetric d. f. u. t. Vs are the n functions Or(xi, ' ■ * , Xn) 
(r - 1, • • •, n). 



A FOItMOLA VOBt mimM Sim ^mAXlOll 7 
TOISRASCB miXTS 

By H. ScBsrF^ and J. W. Tduiy 

Princeton Utmeraity 

Iq a paper to appear in a later issue of this journal doling witii various reeults 
on non-paramelaic estimation, we shall discuss in detail an apprmdmste foraUila 
for the numerical calculation of sample sizes for Wilks’ population toloKinee 
limits. Because of tiie practical usefulness of this formula, it seems desinl^ 
to make it available without delay. Its accuracy is adequate for all direct ap¬ 
plications. 

An interval I is ^d to cover a proportion ir (tf a univariate population wit& 
cumulative distribution function F(x) if JtdF ^ ir. Let Xi, Xj, •••,X* 
be a random sample from the population, and Zi < Z* < • ■ • < Z« be a reaiv 
rangement of Xi, Xj, • • • , X« . Define Zo == — Z«+i = •+•(», and consider 
the proportion B of the population covered by the random interval (Zj,, Zn-mM)- 
Then Pr{B > &} is indep^dent of F{x) if Fix) is continuous^ and eqUi^ 
1 — 7»(n — r -H 1, r), where r ^ k tn and 7,(p, q) is K. Pearson’s notation 
for the incomplete Beta function. 

Choose a confidence coefficient 1 — a, a pair of positive integers k, m, and a 
fraction b. The sample size n for which we can make the statement “the prob¬ 
ability is 1 — a that the random interval (Zt , Zn-wt+i) cover a proporticm b 
or more of the population” is then determined by the equation 

(1.1) hin — r + 1, r) = a, 
where r = k + m. Our approximate solution is 

(1.2) n \xlil + b)/il -b) + Ur - 1), 

where Xa is the lOOa percent point on the x -distribution with 2r degrees of 
freedom. The required values of x« Daay be found for a — .1, .06, .026, .01, 
.005 in Catherine Thompson's table [2], For this range of a, and for b > .9, 
extensive numerical calculations indicate that the error of (1.2) is less than one 
tenth of one percent, and is always positive, that is, n is slightly overestimated 
by (1.2). We have not yet obtained an analytic proof of this statement, which 
refers to the difference from the exact (and, in general, non-integral) solution of 
(1.1). 

As explained elsewhere [1], formula (1.2) may be used for Wald's solution 
of the multivariate case. 
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A GENERALIZATION OF WARING’S FORMULA 


By T. N. E. Greville 
Bureau of the Census 


Waring^s formula (frequently, but less correctly, called Lagrange’s formula) 
gives the polynomial of degree n taking on specified values for n + 1 distinct 
arguments. It is frequently used for interpolation purposes in dealing with 
functions for which numerical values are given at unequal intervals. This 
formula may be written in the form: 


( 1 ) 


/(^) = 



n ^ 



where Oo y ai ^ , • • • , an are the arguments for which the value of the poly¬ 

nomial f(x) is given. This formula was first published by Waring [2] in 1779, 
and it was not until 1795 that Lagrange gave it in his book: Logons Elimeniaires 
sur les Mathimatiques, The prominent British actuary and mathematician, 
Mr. D. C, Fraser states that there are identities of notation in the statement of 
the formula which leave little doubt that Lagrange was simply quoting from 
Waring’s paper.” Waring’s priority was brought to my attention by Mr. Fraser 
and by Dr. W. Edwards Deming. 

If any two or more of the arguments a» arc equal, the form (1) becomes inde¬ 
terminate. However, the limiting value, as m -f 1 specified arguments approach 
a common value a, can be shown to be an expression involving the first m deriva¬ 
tives of the polynomial f{x) for the argument a. This case of “repeated argu¬ 
ments” is of considerable interest, especially in connection with the theory of 
osculatory, or smooth-junction interpolation [1, p. 33]. It is the purpose of this 
note to generalize the formula (1) to the case in which not only the value of 
f(x) but also of its first m,- derivatives are given for each argument a<. The 

n 

degree of the polynomial represented, which we shall denote by A^, is n + ]^ m, . 

»«0 

The generalized formula is: 


( 2 ) 


m = z p.(x - <u) n 1 , 

t«»o L \di dj/ j 


where P,(x — a,) denotes a polynomial in a: — obtained by the following pro¬ 
cedure. First, f(x) is expanded in a Taylor series in powers of x — a*. Next, 

the expression I 1 H-—— ] is expanded as a binomial series for every j 

\ ai Oj/ 

different from i. Finally, all the n + 1 expansions (n binomial and one Taylor) 
are multiplied together, and all terms containing powers of x — Ot higher than 
are rejected. This formula has already been given by Steffensen [1, p. 33] 
for the particular case in which every = 1. 

The general formula (2) is difficult to arrive at without a previous knowledge 



. ' vjmuMCB'Aim'BSOT''SiSFik^ ^ ^ 

of the result, but is easily slrawii to be Ibe correct exprasedcm. Upcnii diSefea^t- 
ing k times (0 g A; ^ mr) all the terms m the summatioQ exc^ tibe ooc 
responding to i » r will contain the factor {x — Or)"’’”*'*’*, and ^ 
vanish for x => Or. Moreover, the non-vanishing term, before dMerentiationv 
will agree, up to and including terms containing (* — a,)"', with the Taylor 
expansion of /(«) in powers of x — o,, since the product exinession wilhin the 
brackets will be exactly canceled, as far as terms of degree Mr , by the n binomial 
expansions. Hence the kth derivative of the non-vanishing term in the summa¬ 
tion will be/*’(o,) for x = a,. This establishes the formula. 

This formula is clearly equivalent to the Newton divided difference intei^ola- 
tion formula with repeated arguments [1, p. 33], the argument o< occurring 
Wf -f- 1 times. Therefore, if /(x) is any function other than a pol 3 momial of 
degree N or less, it is necessary to add a remainder term [1, pp. 22-23] of the form 

Mx) n (X - air*\ 

twO 

where f^ix) denotes the limiting value [1, pp. 20-21] of the divided difference of 
order N involving the arguments a;, oo, ai, • • • , On, with each argument a< 
appearing m, + 1 times. The existence of all the indicated derivatives is, of 
course, essential. 
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NOTE ON THE VARIANCE AND BEST ESTIMATES 

By H. G. Landau 
Washington, D. C. 

The purpose of this note is to point out a certain relation between the vari¬ 
ances, < 7 ? and <72, of the random variables, xi and X 2 , and the probabilities, 

Pi(0 = Prl\xi - E{xx)\ < t] 

P^{t) = Pr[|x, - E(xt)\ <•<]. 

This is, ii<r\ < <r\, their Pi(f) > P*(0 in at least one interval, h <t <tf 
A note by A. T. Craig [1] gave an example for which it was stated that al < v| 
and Pi(t) < Pj(t) for every t; but, as was pointed by Neyman [2], calculation of 
the probabilities involved shows the statement to be incorrect. 

The present result provides a certain justification for the use of minimum 
variance estunates by assuring that no other estimate with the same mean can 
have, for every value of t, a greater probability of a deviation frcan the mean 
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lees than t. If an estimate can be foimd which has a greater value of P(f) for 
all t than does any other estimate, it is necessarily the minimum variance 
estimate. 

The theorem below includes a similar relation for equal variances. This 
theorem can be obtained from known general results on inequalities for distri¬ 
butions determined by moments, [3] and [4]. The formulation given here witii 
its significance for esthnates does not appear to have been remarked. 

Thkoreh. If the random variables, xi and xt , have finite variances, ol and 
, and 

2^2 
Vl S , 

then, either 

Q{t) = Piit) ^ ftCO, 

is equal to zero al all points of continuity^ which can occur only for a\ ^ <t\ y or there 
is an interval, ti < t < k , in which Q{t) is positive. 

Proof. We write the variance as the Stieltjes integral, 


A = f°edPi{t), 

and similarly for <tI . 

Jjet 


S{T) = f tUPiit) - f edPiit) = f fdQit) 
Jq Jo Jo 

= T^Q{T) -2 r tQ(t) dt, 

Jo 


integrating by parts. 

Now 

r*(i - Pi(r)] = r* f" dP,«) < f* <*dP.(<), 

J jT J'r 


and since ol is finite. 



dPiit) -*0 as T 


oo, 80 that Urn r*[l - Pi(r)] 


= 0, and similarly for ^ 2 ( 0 - 

Hence T^Q(T) = T^[l - P 2 (T)] - T^l - Pi{T)]-^{) as T oo, and since 
by definition lim S{T) v <ri — <jr 2 it follows that 



From this it can be seen that either, Qit) vanishes at all points of continuity, 
in which case al — al, or Q(t) must be positive in some interval, since other¬ 
wise / tQ{t) dt must be negative and hence ai — 0*2 > 0 contrary to the assump- 
Jo 

tion, 0*1 < flTj, 



' yiuuAKqs''AN]».Bis(tr.:'i^^ ' /^ ^ 
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NEWS AND NOTICES 

ReaderB are invited to submit to the Secretary of the Institute news items of general interest 

Personal Items 

Dr. R. L. Anderson, on leave from the North Carolina State College, is 
serving as a research mathematician on a war research project at Princeton 
University. 

Professor Kenneth J. Arnold, on leave from the University of New Hampshire, 
is doing war research in the Statistical Research Group, at Columbia University. 

Dr. W. J. Dixon, on leave from the University of Oklahoma, is serving as a 
research mathematician on a war research project at Princeton University. 

Mr. R. M. Foster of Bell Telephone Laboratories has been appointed professor 
and head of the department of mathematics of the Polytechnic Institute of 
Brooklyn. 

Dr. Hilda Geiringer, Lecturer at Bryn Mawr College, has been appointed 
Professor of Mathematics and Head of the Mathematics Department at Wheaton 
College, Norton, Massachusetts. . 

Assistant Professor E. H. C. Hildebrandt of the State Teachers C\)llege, 
Upper Montclair, New Jei*sey, has been appointed to an assistant professorship 
at Northwestern Univei’sity. 

Mr. John F, Kenney of the Unix^ersity of Wisconsin has been promoted to an 
assistant professoi’ship. 

Assistant Professor L. A. Knowler of the TTnivei*sity of loxxa has })een promoted 
to an associate professoi’ship. 

Dr. Saul B. Sells is now Head Statistician of the Statisti(;al Standards Branch, 
Office of Price Administration, Washington. 

Mr. Walter H. Thompson, formerly an instructor at Virginia Polytechnic 
Institute, is now associated xvith the Kellex Corporation, New York. 

Professor Helen M. Walker of Teachers C'ollege, (\)lumbia University, hiis 
been elected president of the American Statistical Association. 

Professor C. C. Wagner of Pennsylvania State College has been appointed 
assistant dean of the School of Liberal Arts. 

Dr. Edward Helly of the Illinois lastitute of Technology died November 28, 
1943. 


New Members 

The following persons have been elected to membership in the Institute: 

Andrews, T. Gaylord. Ph.D. (Nebraska) Instructor in Psychology, Barnard College, 
Columbia University, New York City. 

Angell, Dorothy T. Member of Technical Staff, Bell Telephone Laboratories. Murray Hill, 
New Jersey. 

Barnes, John L. Member of Technical Staff, Bell Telephone Laboratories. 21 North 
Cherry Lane, Rumson, New Jersey. 
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Bloom* Bose, B.A. (Hunter) IVt., Billings (^toeral Hospital, Ft. Benj. iiatiisoii^ii»Biiia 

Bonnary Robert Underwood. M.S. (tJniv. of Washington) Associate Chemist, Bureau ^ 
Ships, Navy Dept. 4^4 Whit^Ume Hood, SUverspHnff, Maryland. 

Brearty, C. R. B.S. (California) Officer in Charge, Quality Control Section^ Signal Ooips 
Inspection Agency. 19 West St., Dayton Ohio. 

Campbell, George Clyde. M.S. (Iowa) Captain CWS Chief, Fiscal Dhr., pine Bluff 
Arsenal. Troy Road^ RFD Boonton, New Jersey. 

Clifford, Paul C. A.M. (Columbia) Asst. Prof, of Math., State Teachers (Allege, Mont¬ 
clair, New Jersey. 641 Upper Mountain Ave. 

Cobb, William J. Statistician, Census Bureau. iOS6 8th St.f NEy Washingtonf D. C. 

Coggins, Paul Pond. M.A. (Harvard) Accountant, American Tel. and Tel. 196 Broad¬ 
way, New York, N. Y. 

Dietzold, RobertL. Ph.B. (Yale) M.T.S., Bell Telephone Lab. S4 W. llthSt., New York, 
N. Y. 

Elconin, Victor. M.S. (California Inst, of Technology) Associate Physicist, California 
Inst, of Technology. 740 Cordova Ave., Glendale 6, California. 

Ferrell, Enoch B. M.A. (Oklahoma) Member of Technical Staff, Bell Telephone Labora¬ 
tories. 76 Fuller Ave., Chatham, New Jersey. 

Ferris, Charles Duncan. A.B. (Princeton) Engineering Statistician, Surveillance Branch, 
T/3 Army. Ballistic Research Laboratory, Aberdeen Proving Ground, Md. 

Goldberg, Henry. M.A. (Ck)lumbia) Asst. Mathematical Statistician, Columbia Uni¬ 
versity, 401 West 118th Street, New York 97, N. Y. 

Gordon, Donald A. A.M. (Columbia) Assistant, Columbia University. 1597 E. 96 
Street, Brooklyn, New York. 

Greenleaf, Herrick E. H. Ph.D. (Indiana) Prof, of Mathematics. 1094 8. College Avenue, 
Greencastle, Indiana. 

Griffltts, C. H. Ph.D. (Michigan) Professor of Psychology. 1607 Charlton Ave., Ann 
Arbor, Michigan. 

Hadley, Clausin D. Ph.D. (Wisconsin) Statistician, Marketing Research Dept., Eli Lilly 
and Company, Indianapolis 6, Indiana. 

Halbert, K. W. A.M. (Harvard) Statistician, Amer. Tel. and Tel. Co., 196 Broadway, 
New York, N. Y. 

Hall, Marguerite F. Ph.D. (Michigan) Asst. Prof. Public Health Statistics. 96 Ridge¬ 
way, Ann Arbor, Michigan. « 

Halmos, Paul Richard. Ph.D. (Illinois) Asst. Professor of Mathematics, Syracuse Uni¬ 
versity. 615 Fellows Avenue, Syracuse 10, N. Y. 

Harold, Miriam S. B.A. (Hunter) Member of Technical Staff, Bell Telephone Labora¬ 
tories. 19 Hillside Avenue, Chatham, New Jersey. 

Hatke, Sister M. Agnes. M.S. (Indiana State Teachers College) Graduate student at 
Purdue University. St. Francis College, Lafayette, Indiana. 

Hizon, Manuel O. M.A. (Michigan) Philippine Govt. Scholar at Univ. of Michigan. 
816 Packard, Ann Arbor, Michigan. 

Hodgkinson, William, Jr. B.A. (Harvard) Major, HQ AAF, U. S. Army. 196 Broadway, 
New York, N. Y. 

Jacob, Walter C. Ph.D. (Cornell) Lt., U. S. Navy. Bureau of Ships, Navy Dept., 
Washington, D. C. 

Jones, Howard L. C.P.A. (Illinois) Supervisor of Revenue Results, Illinois Bell Tele¬ 
phone Co. Room 1100, 509 W. Washington St., Chicago 6, III. 

Kaltz, Hjrman B. A.B. (George Washington Univ.) Statistical Consultant, Psychological 
Research Unit No. 11, B.A.A.F., Fort Myers, Fla. 

Ke 3 rfitz, Nathan. B.Sc. (McGill) Statistician, Dominion Bureau of Statistics, Ottawa, 
Canada. Billings Bridge, Ontario. 

Kirchen, Calvin J. M.A. (Wisconsin) Product Control Statistician, U. S. Rubber Co. 
4098 11th St. Place, Des Moines 15, Iowa. 
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LaSalt, Lucy Anne. B.A. (Hunter) Applied Math Group, Columbia Univ. Irpinff 
Avenue, Brooklyn $7, New York, 

Leone, Fred Charles. M.S. (Georgetown Univ.) Instructor in Math., Purdue Univ. 
SIO N, Saliebury St., W, Lafayette, Indiana. 

McNamara, Kathryn J. M.A. (Clark Univ.) Economist, Business Research Dept., H. J. 
Heinz Company, P.O. Box 67, Pittsburgh 30, Pa. 

McPherson, John Cloud. B.S. (Princeton) Director of Engineering, InUl Business 
Machines Corp. 690 Madieon Ave., New York 99, N. Y. 

Millikan, Max F. Ph.D. (Yale) Asst. Director, Div. of Ship Requits., War Shipping 
Admin. 9S1S Huidekoper PI., NW, Washington 7, D. C. 

Morrow, Dorothy Jeanne. M.S. (Univ. of Washington) Fellow in Mathematical Statistics, 
Columbia University. 606 W. 116 St., New York 96, N. Y. 

Morse, John W. M.A. (Ck>lumbia) Ordnance Engineer, Quality Control Unit Inspection 
Sec., Ammunition Branch, War Dept. 11 Verne St., Bethesda, Md. 

Mottley, Charles McCammon. Ph.D. (Toronto) Lieut., USNR, Bureau of Ships, (333), 
Navy Dept., Washington, D. C. 

Murphy, Ray B. B.A. (Princeton) 2nd Lt., U.S. Marine Corps Reserve. 96 Godfrey Bd., 
Upper Montclar, N. J. 

MysUvec, Vaclav. Ph.D. (Prague) Czechoslovak delegate to the United Nations Interim 
Commission on Food and Agriculture. Room 606,1776 Broadway, New York 19, N. Y. 

Nicholson, George Edward, Jr. M.A. (Univ. of North Carolina) Asst. Mathematician, 
Applied Mathematics Group, Columbia Univ. 176 Park St., Montclair, N. J. 

Osterman, Herbert William. B.S. (Michigan) c/o B. E. Wyatt, 1099 Vermond Ave. NW, 
Washington, D. C. 

Parke, Nathan Grier, III. A.B. (Princeton) Senior Aviation Design Research Engr., 
Navy Dept. Malvern Ave., Ruxton 4, Md. 

Priestley, Alice B. A. M.A. (New York Univ.) Instructor in Math, Lafayette College. 
996 McCartney St., Easton, Pa. 

Rapkin, Chester. M.A. (American University) Associate Statistical Analyst, Deputy 
Director, Division of Operating Statistics, Federal Home Loan Bank Admin., 9 Park 
Avenue, New York 16, N. Y. 

Rivoli, Bianca. M.A. (Columbia University) Statistician, Judson Health Center. 3966 
Bainbridge Avenue, New York 67, N. Y. 

Roshal, Sol M. B.S. (Chicago) OfEcer in charge of research statistics, PRU Nashville 
Army Air Center, Nashville, Tenn. 

Ross, Frank A. Ph.D. (Columbia) Editor, Journal of the American Statistical Assn. 
Thetford, Vermont. 

Schaeffer, Esther. A.B. (Chicago) Technical Asst., Univ. of Michigan. 647 Elm Street, 
Ann Arbor, Mich. 

Schilling, Walter. M.D. (Harvard) Asst. Clinical Professor of Medicine, Stanford Uni¬ 
versity Hospital, San Francisco 15, California. 

Shannon, Claude E. Ph.D. (M.I.T.) Member of Technical Staff, Bell Telephone Labora¬ 
tories, 403 West Street, New York, N. Y. 

Sherman, Jack. Ph.D. (Calif. Inst, of Technology) Research Chemist, The Texas Co. 
170 Church St., Poughkeepsie, New York. 

Smallwood, Hugh M. Ph.D. (Johns Hopkins) Asst. Dept. Head, Physical Research 
Dept., U. S. Rubber Co., Mhrket A South Streets, Passaic, N. J. 

Smith, Joan Thiede. B.S. (Minnesota) Accountant. 678 East Nebraska Ave., St. Paul, 
Minn. 

Smith, R. Tynes, m. Chief, Transport Economics Branch, Traffic Control Div., Office 
Chief of Transportation, A. S. F. 100116th St. S., Arlington, Va. 

Sobczyk, Andrew. Ph.D. (Princeton) Staff Member, Radiation Lab., Mass. Inst, of Tech¬ 
nology. 39 Bow Street, Lexington 79, Mass. 
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TlHKttoae,IottiitLMii. Ph.D. (CStict^) Profeasor6!I^yeMagy.Univemtjrof Chfeiif»« 
C^^cl^p>, niinoia. 

Tonlball«, Iioopddo V. Ph.D. (Michigan) Special Inatruetor, University of IjiCiehigiHi. 
llWWittard8t.fAfmArhor,Mich, 

Waldi( Joka X. B.8. (Notre IHune) Matiiemstician, Lockheeiil Air^t Carp. 7P7 Etui 
SXk Ave., Glendale S, Calif. 

Weber, Bmee Travis. M.A. (Columbia) Member of Techmcal Staff, Bell Teieplxnie 
Laboratories, Marray Hill, New Jersey. 

Weiner, Lonls. AM. (Harvard) Economist, U. 8. Bureau of Labor Statistics. 
RueeeU Ave., ML Rainier, Maryland. 

Woodward, Patiida. Ph.D. (Pennsylvania) Associate Executive Secretaiy, Committee 
on Food Habits, National Research Council. tlOl Conetitution Ave., Waehington SS, 
D.C. 



REPORT ON THE WASHINGTON MEETING OF THE INSTITDTB 

The second regional spring meeting of the Institute of Mathematical Statistics 
was held at George Washington University, Washington, D. C., Saturday and 
Sunday, May 6 and 7, 1944, jointly with a regional meeting of the American 
Statistical Association. The 383 registrants at the joint meeting included the 
following 76 members of the Institute: 

Paul H. Amlerson, U. L. Anderson, Theodore W. Anderson, Jr., Kenneth J. Arnold, 
Kenneth J. Arrow, 1st Lt., AC, Blair M. Bennett, Ernest S. Blanche, C. I. Bliss, Bonnar 
Brown, Joseph G. Bryan, Marjorie F. Buck, A. George Carlton, C. W. Churchman, William 
J. Cobb, Major A. C. Cohen, Jr., Edwin L. Crosby, Haskell B. Curry, J. H. Curtiss, B. B. 
Day, Scott Dayton, W. Edwards Deming, Philip Desind, W. J. Dixon, J. L. Doob, Will 
Feller, William C'. Flaherty, Thomas N. E. Greville, E. J. Gumbel, Trygve Haavelmo, 
Margaret Jarman Hagood, Morris H. Hansen, Elvin A. Hoy, Leonid Hurwicz, Lt. (jg) W. C. 
Jacob, Alice S. Kaitz, Evelyn M. Kennedy, Lila F. Knudsen, H. S. Konijn, T. Koopmans, 
Anita R. Kury, Jacob E. Lieberman, Philip J. McCarthy, Francis McIntyre, William G. 
Madow, Lt. C. J. Maloney, Sophie Marcuse, John W. Mauchly, A. M. Mood, Vladimir A. 
Nekrassoff, Monroe L. Nordeii, H. W. Norton, Victor Perlo, A. C. Rosander, William Salkind, 
Marion M. Saiidomire, Mtix Sasuly, Franklin E. Satterthwaite, S. B. Sells, L. W. Shaw, 
W. Arthur Shelton, Walter A. Shewhart, Harry Shulman, Blanche Skalak, John H. Smith, 
R. T. Smith, Arthur Stein, Joseph Steinberg, J. W. Tukey, Joseph L. Ullman, David F. 
Votaw, Jr., Capt, A. N. Watson, Frank M. Weida, Ijouis Weiner, S. S. Wilks, Patricia Wood¬ 
ward, Bertram Yood. 

All sessions were held jointly with the American Statistical Association. 

Professor Will Feller of Brown University acted as Chairman for the Saturday 
afternoon session. The following papei's were presented: 

1. Elements of the Theory of Testing Hypotheses. 

J. H. Curtiss, Jr,, Navy Department 

2. Large Sample Tests of Statistical Hypotheses. 

Abraham Wald, Columbifi University 

Professor Frank Weida of George Washington Univemity acted as Chairman 
for the Sunday morning session. The following contributed papers were 
presented: 

1. On the Statistics of Sensitivity Data. 

C. West Churchman and Benjamin Epstein, Frankford Arsenal and the University of 

Pennsylvania * 

2. Simplified Plotting of Statistical Observations. 

E. J. Gumbel, New School of Social Research 

3. Distribution of Sample Variances and Covariances of Normally Distributed Noncentral 

Variables. 

M. A. Girshick, Department of Agriculture 

4. An Application of the Variate Difference Method to Multiple Regression. 

Gerhard Tintner, Department of Agriculture 

5. Autocorrelation in London Temperature. 

Horace W. Norton, Department of Commerce 

m 
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Professor Samuel S. Wilks of Priiicettm University acted as Chainnan f(^ iNi 
Sunday afternoon session. The following papers were presented: 

L RegrcBsian Problem in Time Series. 

Tjalling Koopmans, Combined Shipping Adjustment Board 

2. Foundations of the Theory of Time Series, 

J. L. Doob, Navy Department 

The business meeting of the Washington Chapter of the Institute was held 
Sunday morning. The proposed Constitution of the Washington Chapter was 
ratified, and elections under this constitution were held for the first time. The 
following officers were elected: 

William G. Madow, Census Bureau—3 year term, Secretary of the Program Committee 
1944-45 

Solomon Kullback, War Department—2 year term 

Frank Weida, George Washington University—1 year term. 


W.G. Madow 
Secretary, Program Committee 
Washington Chapter 




THE ELEMENTARY GAUSSIAN PROCESSES 

By J. L. Doob 
University of Illinois 

1. Introduction 

One of the simplest interesting classes of temporall}'^ homogeneous stochastic 
processes is that for which the distributions of the defining chance variaWes 
{x(t)] are Gaussian. It is supposed that 

(A) if /i < • • • < tp, the multivariate distribution of x(<i), • • • , x{tp) is 
Gaussian/ and that 

(B) this distribution is unchanged by translations of the ^-axis. 

The process is iV-dimensional if x{t) is an iV-tuple Xi{t)f •• • • , Xjfit). The means 
E{x{t)]^ are independent of t, and will always be supposed to vanish in the 
following discussion. 

The correlation matrix function R{t): (r.i(O) is defined by 

(1.1.1) rail) = E[xi{s)xiis + t)\. 

This expectation is independent of s, l^ecause of condition (B). The matrix 
function R{t) satisfies the equation 

(1.1.2) ^ijiO ~ i, ^ 1, • • • , AT. 

It follows that when / = 0 the matrix is symmetric: 

(1.1.3) ^ ro(0) = ry.-(0), i,; = 1, • • • , N, 

and it is also well known that R(0) is non-negative definite. Conditions on the 
functions riXO necessary and sufficient that R(f) be the correlation matrix 
function of a stochastic process were found for the case AT = 1 by Kbintchine* 
and for all JV by Cram6r.^ 

Hypothesis (A), that the process is Gaussian seems at first a restriction so 
strong that Gaussian processes are unimportant. These processes are, however, 
of fundamental importance, for the following reasons. 

(i) If R(t) is the correlation matrix function of an^ temporally homogeneous 
stochastic process, there is, according to Khintchine and Cramer, a Gaussian 
process with this same correlation function. This Gaussian process is uniquely 
determined by the correlation function (assuming that all first order moments 
vanish, as usual). Because of this intimate connection between the temporally 
homogeneous Gaussian processes and the most general temporally homogeneous 

^ Singular Gaussian distributions will not be excluded. For example the x(t,‘) may all 
vanish identically. 

* The expectation of a chance variable x will be denoted by B{x}, 

^ Matematische Annalen^ Vol. 109 (1934), p. 608. 

^ Annoys of Math,, Vol. 41 (1940), pp. 21&-230. 
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processes, it is not surprising that very few facts are known about specifically 
Gaussian processes, that is facts which are true of temporally homogeneous 
Gaussian processes, but not of temporally homogeneous processes in general. 

(ii) It follows from (i) that in any investigation of temporally homogeneous 
stochastic processes involving only first and second mioments—^for example 
least squares prediction by linear extrapolation—^it may be assumed that the 
variables are Gaussian. Under this hypothesis, the investigator may be helped 
by the suggestive specialized interpretations possible in the Gaussian case of 
results which hold in the general case. For example if == 1, the least squares 
best prediction in the Gaussian case for x(n + 1) in terms of a linear combination 
of the variables a;(l), • • • , x{n) is the conditional expectation of x{n + 1) for 
given x(l), • • • , x(n), which is the least squares best prediction of x{n + 1) 
in terms of a:(l), • • • , x{n) with no restriction on the functions involved. Thus 
the linearity of the prediction, which must be part of the hypothesis in the gene¬ 
ral case, is automatically true in the Gaussian case. There is necessarily a linear 
least squares best prediction of x(n -f- 1) in terms of the complete past 
• • • , x{n — 1), x{n) since the corresponding conditional expectation is certainly 
defined in the Gaussian case, and is linear in that case. 

(iii) In many applications, there is a real justification for hypothesis (A) that 
the process is Gaussian. This is so in certain physical studies, for example, 
because the Maxwell distribution of molecular velocities is Gaussian. Examples 
will be given below. 

The processes discussed in the present paper are all tempoi*ally homogeneous 
Gaussian processes. Most of the theorems will be valid for any temporally 
homogeneous processes for wliich the second moments of the variables exist,^ 
with the following changes: independent chance varial)les which^re linear com¬ 
binations of the x{s) will become merely uncorrelated chance variables; the 
convergence with probability 1 of a series of such chance variables will become 
merely convergence in the mean; the conditional expectation of one such variable 
y for given values of others, yi,y 2 j • • • will become merely the linear approxima¬ 
tion 2 y terms of the y, which minimizes 

i 

i 

that is to say the conditional expectation becomes the least squares lineai' 
approximatibn. 

The following theorem and its corollary are fundamental in the study of linear 
prediction involving infinitely many variables. The results are implicit in much 
of the work on the subject but do not seem to have been stated explicitly before. 

Theorem 1.2. Let •••, xo, Xi ^ be a sequence of one-dimensional Gaussian 

* The processes need not even be temporally homogeneous. It is necessary only that 
j&{x(«)) and J57{x(«) x(« + 01 he independent of s. 
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chance variables vnth the property that if ni < • ■ • < rit, the muUieariate (attri¬ 
bution 0 / x„,, • • • , Xn, is Gaussian and that 

(1-2.1) E{-- - , x„] =*«* 

whenever m < n. Then E{xm\ = a is independent of m, and 

(1.2.2) ••• g £:{(*« - af\ g E{{x^+i - a)‘) g ■ • • . 

If the {x„} are defined for all negative integers, 

(1.2.3) iim Xm « x-« 

exists with probability 1 and 

(1.2.4) lim £?{(x_» - x„)*| = 0. 

m-^oo 

If the {xn} OLre defined for all 'positive integers^ and if the dispersions in (L2.2) 
form a hounded sequence, 

(1.2.3') lim Xyn == Xao 

7n-*oo 

exists with prohability 1, and 

(1.2.40 lim E{{x» — x„)0 = 0.^ 

»n-*«o 

It follows from (1.2.1) that 

(1.2.5) E{xn\ = E{E\xm',x.n\ \ =E{x„]. 

Hence • • • = £{xol = = • • • . It will be no restriction to assume from 

now on that 

••• = e{xo} = E\xx\ = - •• = 0. 

It also follows from (1.2.1) that 

(1.2.0) E{x,nXJi = E{E\x„, ■, X,nXj^\ = E{XmE{Xn,\Xn\] = E\xl,\.^ 

Using this equation, 

(1.2.7) E{x\] = E[[{x„ - x„) + xJO = E\{xn - a!«)'} + E[xl}, 

and the dispersions of the Xn thus form a monotone non-decreasing sequence. 

* The conditional expectation of a chance variable y for given values of a chance variable 
7] will be denoted by E{ri; y\, 

^ Much of this theorem remains true if (1.2.1) is true but only the first moments of the 
Xn are supposed finite, no other hypothesis being made on their distributions. Cf. Doob, 
Am, Math, Soc. Trans., Vol. 47 (1940), pp. 458-460. 

* Alternatively^ (1.2.1) implies that Xn — Xn is uncorrelated with a;». Then E[XmXn\ =* 
E{xl + (Xn - X„,)Xm] « Elx^]. 
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FinaUy, uting (15.6), 

(1.2.8) E[(x„+i - Xm)ixn+i — *»)}= 0. 

The aeries 

(1.2.9) Z(awi-*-) 

m 

is therefore a series of mutually independent chance variables. According to a 
well known theorem of Kolmogoroff, a sequence of mutually independent chance 
variables converges with probability 1 if the means and dispersions fonn a con¬ 
vergent series. The present theorem follows at once from Kolmogoroff’s 
theorem. 

Corollary. Lei x be a one-dimensiondl Gaussian chance variable and let 


( 1 . 2 . 10 ) 


) "^^^02 j * * * 
^11 ) ^12 ) * * ■ 
Xii , X22 , * * * 


be sequences of one-dhnensioncd Gaussian chance variables with the property that 
i^ v'^l^ the multivariate distribution 0 / x, , • • • , x^v is Gaussiarty and suppose 
that each variable Xmn is a member of every later sequence.. Then 


( 1 . 2 . 11 ) 


lim E{x„i,Xna, • • • ; .x} = x- 
«*-♦— «0 

lim E{xni,x,„ 2 , ••• ;a:} =*= 


exist with probability 1 , and in the mean, and 

(1.2.12) X 4 . = E{xmn , m = 0, =bl, • • • , n = 1 , 2, • • • ; x}. 

It will first be shown that the sequence where 

(1.2.13) Xm — L (Xmi y Xfn2 j ’ ' * i > 

has the property demanded in the theorem. In fact, from the definition of 
conditional expectation, the difference x — Xn has expectation zero and is in¬ 
dependent of the variables {x^^} for m g n, and therefore of the variables 
• • • , Xn~i, Xn . Hence 

(x Xn) ~ (x ““ Xn-f-l) ~ Xn+l ~~ Xn 

has expectation zero and is independent of the variables • • • , Xn-i, Xn . There¬ 
fore the sequence {Xn^i — Xn} is a sequence of mutually independent chance 
variables with vanishing expectations. This implies ( 1 . 2 . 1 ) if rri < n because 


E{ * * * > Xyn » Xn) 

(1.2.14) 


; aw + 2 (*/+» - a;y)| » J5{ • • • , aw ; aw} 

n -1 

+ • • • , a;« ; 1,4.1 - xy} = x« . 
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(1.2.16) £|®„ - a)’} + £((* *- a:,)*} - £{(* ^ o)*). : ’ «■. / S 

Hence the sequence of dispersitnis of the Xm is bounded and aoe6itding‘te'11ieor^ 
1.2 the limits z- and x+ in (1.2.1) exist with probability 1. Since a; ^ pSi, hiss 
expectation aero and is independent of Xms for m ^ n, x — x+ idso has eiqitectsi* 
tion aero and is independent of for all m, that is (1.2.12) is true. 

The simplest non-trivial special case of this theorem » the following: 
Let Xi,Xt, • • • yX be one-dimensional Gaussian chance variables with the proper^ 
that if V 1 the multivariate distribution of x,xi, “• ,x, is Gaussian. Then 

(1.2.16) lim E{xi , ■ • • , x„ ; *} = £{a:i, *», • • • ; x], 

n-*00 

vnth probability 1, and this limit is alsQ a limit in the mean. 

As stated, the theorem and corollary are true without the hypothesis tjiat Hie 
chance variables concerned ar^ Gaussian. (The existence of second moihents 
must be assumed if the Umits are to exist as limits in the mean.) They are 
stated for Gaussian variables because the proof is simple in that case, and be¬ 
cause that is sufficient for the purposes of this paper. 

In discussing t.h.G. processes whose parameter t is not restricted to be integral, 
the usual continuity hypothesis will be made. It will be supposed that R(t) 
is continuous at ^ = 0: , 

(1.3.1) lim [£(<) - fi(0)] - -pm E{[x(,t) - x(0)]‘} = 0. 

<-*0 

It is then easily concluded that R{t) is everywhere continuous. 

In the continuous parameter case, it would be useful to have the conditions 
on R{t) necessary and sufficient for the continuity in ^ of the chance variables 
x{t) and for the existence of the derivative. No set of necessary and sufficient 
conditions for the continuity of x(f) is known, although the fact of continuity 
will not be difficult to prove in the special cases to be considered in §4. The 
conditions for the existence of x^{t) are quite simple, and will be given in Theorem 
L4. 

The spectral function of a one-dimensional t.h.G. process will play an essential 
role in some of the theorems to be discussed below. If R(n) is the correlation 
function of a one-dimensional t.h.G. process, R(n) can be expressed in either 
of the forms 


R(n) 


^ cos nX dF(X) 


(1.3.2) 


n ~ 0, ±1, • • • , 



where, l’(X), called the spectral function of the process, and (r(X), call^ the 
comi^x spectral function of the process, are real monotone non-decreating 
functi(Hls satisf 3 d[ng the following conditions: 

F(0) = 0 . G(-r) =0 

(1.3.3) F(X-) = f(X), 0 < X < IT, (?(X-) = C?(X), -T < X < «■ 

G{\) - (?(0+) = (?(0) - (?(- X+) 

F{r) = (?(t) 

(1.3.4) F(X) = G(X) - Gi- X+) = 2G(\) - (?,(0) - (?(0+), 0 < X < x 
F'(X) = 2G'{\f 


The last equation of course holds only at points where the derivatives exist. The 
forms (1.3.2), (1.3.2') are derived trivially from each other. The correlation 
function determines the spectral functions uniquely, if the latter are supposed 
to satisfy (1.3.3). In fact, at the points of continuity of F(X), (?(X): 


^’(X) = 


+-t. R(n) 

T T 1 n 


(13.SI 


e(x) . + ' lim t B(n) i 

AV ZiTTl —r 

njdO 


in\ 

n 


Conversely if any F(X) or G(S) satisfying the stated conditions is used to deter¬ 
mine an R{n) by means of (1.3.2) or (1.3.2'), R{n) is the correlation function of a 
t.h.G. process. The representation of R{n) in terms of G(X) is frequently more 
convenient than that in terms of F(X), because of the simple properties of the 
exponential function. The following relation, which will be used below, exhibits 
the elegance attained by the use of 6(X): 


Cm»:(»»)]•£ c»a:(»)l} = Z) (Lc"R{m — n) 

m n m.n 

(1.3.6) _ 

= / (LcLe*’^)(£cU’'")dG(\). 


The correlation function of a one-dimensional continuous parameter t.h.G. 
process can be represented in either of the following forms: 


(1.3.7) 



cos t\ dF^X) 


(1.3.7') 


R(t) = dG(X) 


• H. Wold, A Study in the Analysie of Stationery Time Series, Uppsala, (1938), p. 66. 



vhxise thfi ^tral toctifla >0^ Him oatttirfjBi 4^ 

monotone non^ieci^i^. i^ satisfjr the (Kuiditfotor'' ‘ .^ 

',^’(0) = 0" ■", ,". ■ ^(-ooo = 0"' ■ 

(1.3.8) F(X-) = ^(X), 0<X<oe (?(X-) * <?(X) ' > 

(?(X) - (?(0+)> G(0) - G(-X-f) 

ii’(oo) = G(oo) 

(1.3.9) F(X) = G{\) - G{- X+) = 2G(X) ^ G(0) -'G(0+), 0 < X < «, ■ 

F'(X) = 2(?'(X). _ 

The last equation of course only holds at points where the derivatives exist. 
The correlation function R(t) determines tljie spectral functions uniquely if the 
latter are supposed to satisfy (1.3.8). In fact, at the points of continuity of 
F(K),G(\): 

F(\) = - I 12(0 — ^ 

TT Jo 


(1.3.10) 


t 


di 


(?(X) = Urn / R{t) dt + (?(0). 

r-*oo J-r 


t 


Theorem 1.4. Let {a;(/)} be the variables of a one-dimensional continuous 
'parameter t,KG, process with correlation function R{t) arid spectral function F{K). 

If 

(1.4.1) f X*dF(X) < =c 


(i) i2'(0» (0 are continuous^ and R'(0) = 0; 

(ii) x{t) is an absolutely continuous function of t, with probability 1; 

(iii) for each t, 


(1.4.2) 


lim = x'it) 

)»-*o n 


exists, with probability 1, and this convergence is also true in the mean: 
(1.4.3) lim - x'(<)J| = 0 ; 


(iv) the x'(0 process is a tXG. process^ with correlation function —i?"(0 and 
spectral function 
Conversely if 

(1.4.4) M < 

then (1.4.1) is true. 


[ XUFiX). 

Ja 
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This theorem is due to Slutsky. The proof will be sketched here, for oonx' 
pleteneas. (The hj^thesis that the process is a Gaussum process is immaterial, 
since only the second moments are invblved in the proof.) 

Proof of (»). If the integral (1.4.1) exists, /?'(<)» fl"(0 can be obtained by dif¬ 
ferentiating under the integral sign in (1.2.2): 


(1.4.5) 


ft'(<) = —j X sin (k dFQi) 
R'\t) =• —X*cosAdF(X). 


Then R'{t), R"it) are continuous functions, and J'?'(0) = 
Proof of (U), (Hi), (iv). The quantity 

^ 4 g) J!,j ]Xit + hi) - x(f) _ x(,t -h hi) - x(t) 




J} 


can be evaluated in terms of the coirelation function /?(<), and approaches 0 
with hi y hi y if the second derivative /2"(0 exists. There is therefore a chance 
variable y(t) to which the difference quotient converges in the mean; 


(1.4.7) 


lim - y(f) J| = 0. 


Jhe y{t) proceas is a t.h.G. process. Moreover the equation 
(1.4.8) E{x(8)x( 8 + ()]==: R{t) 


cap be differentiated to give 


(1.4.9) E{x{8)y{s + 0! = - t)y{8)} = R'{t) 

and this in turn when differentiated becomes 


(1.4.10) E{y{8 ~ t)yi8)} = E{y(8)y(s + 01 = 


Hence the y(t) process has correlation function —/?"(0. Finally, 
- a:(0) - f‘ yis) dsjj = E{[x{i) - ^(O)]*} 


(1.4.11) 


+ f‘ {j/(8)j/(«')} ds ds' -2 j* E{[xit) - a:(0)]j/(«)} ds = 0, 


(ev^uating the right side of (1.4.11) in terms of R{t);R'(f), R"(t)). Thusa:(<) 
is absolutely continuous, with probability 1, and y{t) is the derived fimction 
*'(<). Hence x'{t) exists for almost all t, with probability 1.*® It follows (Fu- 
bini's theorem) that the limit in (1.4.2) exists for each t, with probability 1, 
except possibly for a <-set of Lebesgue measure 0. Since the process is t.h., the 


For the exact meaning and meaaure-theoretic justification for statements of this type, 
see Doob, Am. Math. Soe. Trant., Vol. 42 (1937), pp. 107-40. 






exm^bnal set must be diber ^pty or the whole f-'lihe. The dtOi^ptiohaStliet^ 
is therdore «npty. 

Conversely if (1,4.4) is true, (1.4.1) follows at once from (1.3.)0- r 
It yiH be convenient to ute condensed notation below. If ® ; (%,•••, *»), 
V' t y^) erre AT-dimensuMial vectoa« and if A: (a<y) is an iVyimension^ 


^lare matrix, i-y will denote the matrix (a;d?y). Ax the vector with ccHJ^nents 

2 o<y*y a^d (x, y) the number x,^i . The adj<|int matrix (a*/): «<,• * dyy 

# - * ' ■ • , 
will be denoted by A*. Throughout this paper, the chance variables will be 

real-valued, but it will be convenient to use complex constant vectors. The 

identity matrix will be denoted by I. It will be convenient to denote the t, ith 

term of the matrix A by (A)*,-. The following equations will be used frequency: 


Ax-By = A(x-y)B*, {Ax, y) = (®, A*y). 

If x'is a chance variable, it is clear that E[x-x\ is a symmetric non-n^ative 
definite matrix. 

The simplest Gaussian processes are those in which the distribution of future 
states is based not on the complete past, but only on the immediate present. 
The precise definition of this (Markoff) property is the following. 

(C) If <1 < • • • <>.+1 the conditional distribution of a:(<,+i) for given values of 
x{ti), , x{t,) depends only on the value assigned to a:(<,). The conditional 
distribution of x{t,+i) for given values of x{ii), • • • , xiU) will then be simply ttie 
conditional distribution of x{ty+i) for the assigned value of x{t,). 

The, processes to be discussed in this paper are the processes with properties 
(A), (B), (C): temporally homogeneous Gaussian Markoff (t.h.G.M.) processes. 
The properties of t.h.G.M. processes will also be used to derive properties of the 
most important simple types of one-dimensional t.h.G. processes—those with 
rational spectral density functions. Some of the results are contained implicitly 
in the work of previous writers, but the presentation of the results has in all cases 
been chosen to stress their specific probability significance, and may therefore 
appeal even to readers familiar with previous work. 

The condition (C) on a Gaussian process is equivalent to the condition (C') 
that if <1 < • • • < t,+i 


(1.5.1) E\x{ti), ••• , x{ty)-, a:(Wi)l = E{x{t,); x{t,+i)]. 


In fact (C) is at least as strong as (O'). Conversely if (C') is true. 


( 1 . 6 . 2 ) 


x(i»+l) — 3;(i>4.l) E\x{tr)', x(i»+i)) -|- E{x{t,)‘, x(tr^.i)| 

= y -f E{x{t,y, x(Wi)}, 


where y is a Gaussian chance variable with mean 0 uncorrelated with and 
therefore independent of x(<i), • • • , x(<,), and the last term of (1.5.2) is simply a 
multiple of x{t,). Th^ the conditional distribution of x(fr+i) for given z(<i)) 
• • • , x{Q is a Gaussian variaUe, with mean £{x(f,); x(f,.(.j) j and disperrion that 
of y. Since this conditicmal distribution depends only tm x(<,), property (C') 
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implife property (C). Hfence these properties are equivalent* The condition 
(O') can be written in the form 

(1.5*3) E{x{t), r g s; a;(« + 0} = E{x{8); x(s + <)}» ^ > 0. 

In many applications the stochastic processes either have this property already 
or will have it if the dimensionality of the processes is increased by the adjunc¬ 
tion of auxiliary chance variables. In the latter case the process is called a 
component process of a t.h.G.M. process. Component processes are discussed 
in detail below. If a process is a t.h.G. process, the right side of (1.5.3) is a 
linear transformation (depending only on t) of x{s): 

(1.5.4) E{x{s);xis -f 0} = d(0x(s), t > 0. 

The matrix function A(t) will be called the transition matrix function. It 
satisfies the equation (obtained by performing the operation E{x{s)-] on both 
sides of (1.5.4)) 

(1.5.5) R(t) = R(Q)A(tr, t > 0, 

but is otherwise unrestricted since if (1.5.5) is true, the difference xit) — d(0:i^(s) 
is uncorrelated with and therefore independent of x(s). In many applications 
the elements of R{t) will vanish identically except in square matrices down the 
main diagonal. If this is true, ^(0 can also be assumed in this form. 

If the variables {.t( 0} determine an iV-dimensional t.h.G.M. process, and if B 
is a non-singular iV’-dimensional square matrix, the variables {Bx(t)} also de¬ 
termine a t.h.G.M. process. Two processes connected in this way will be called 
equivalent. If two t.h.G. processes are equivalent, and if one is a Markoff 
process, the other must be also. If there is a change of variable 

(1.5.6) y{t) = Exit) 

taking the t.h.G.M. x{t) process with transition matrix A{t) and correlation 
matrix R{t) into the equivalent y{t) process with transition matrix Ai{t) and 
correlation matrix R\{t), then 

(1.5.7) Ai{t) = Riit) = BR{t)B*. 

If {x(0}, {2/(<)l» { 2(01 determine t.h.G. processes of dimensions a, /S and 
a + respectively, if the process determined b}^ 

{ri(0, • • • , Xait)y yi{i)y • • • , yjiif)] 

is equivalent to the z{t) process, and if every x{8) is independent of every y{t)y 
the z{t) process will be called the direct product of the x{t) and y{t) processes. 
The extension of the definition to direct products of more than two processes 
is clear. If the a;(0 and y{t) processes are Markoff processes, their direct product 
is also a Markoff process. Conversely if the z{t) process is a Markoff process, 
the factor processes must also be Markoff processes. The following facts about 
matrices will be used below. If A is any iVT-dimensional matrix, there is a non- 
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singulM N'KlimeBsidnal matrix B'moh BT'^AS jus i& Joi^ti oaB<na»»d 
the elementB of vaoish except for those m ced»in subnrntri*^ downi^O^^^ 

main diagonal. Each of these submatrices has the form ; . 

( X 0 • • • 

^ ^ ® ; ! 

0 • • 0 1 X/ 

or simply (X) if it is one-dimensional. The X’s are the characteristic values of 4, 

that is tile roots of the characteristic equation det. \ A — X/ ( = 0, and the sum 
of the dimensions of the submatrices with a given X is the multiplicity of X as a 
root of this equation. The matrix A is said to’have simple elementary divisors 
corresponding to a given root X of the characteristic equation if the submatrieeis 
in (1.5.7) with that X are all of dimension 1. Thus orthogobal matrices, sym¬ 
metric inatrices, and skew symmetric matrices have only simple elementary 
divisors, since they can be put in diagonal form, (with X’s of modulus 1, real, 
pure imaginarj^, respectively). The transformation B and the X's may not be 
real. If A is real, however, there is a real matrix B such that the elements of 
B~^AB vanish except for square submatrices down the main diagonal, and the 
characteristic roots of different submatrices are neither equal nor conjugate 
imaginary. 

The powers of a matrix in .Jordan canonical form are easily calculated using 
the fact that 


(1.5.9) 



0 . . . 

0 \’‘ 

/ X” 0 • 

• \ 


X 0 • • 


nX"-' X” 0 

: M 

\o 

•01 

J 


■ nX”-‘ X”/ 


It follows that in the general case the elements (A")<j are linear combinations 
of \j , nX," , etc., where Xi, Xj, • • • are the characteristic values of A. Hence 
if (A")i, —> 0 as n —> », the approach must be exponential. The terms of A" 
certainly go to 0 if all the characteristic values of A have modulus less than 1. 

The matrix is defined by the usual series formula for the exponential func¬ 
tion. If A has the form (1.5.8), can bO calculated using (1.5.9): 


(1.5.10) 


I 


0 


te^ s'" 


It follows that in the general case the elements are linear combinaticms of 
, etc. where Xi, X*, • • • are the characteristic values of A. If («*■*)</ —»0 
ast—* 00 the approach must be exponential. The terms of certainly go to 0 
if all the characteristic values of A have negative real parts. 




240 


l. U DOOB 


A t.h.G*M. prooeas will be calted detennini^ if the least squares predictioa 
of x(« + t) for given (t > 0 ), that ib E{x(8); x(8 + 0} is always correct: 

(1.6.1) x(8 + 0 * A(t)x(8) t > 0, 

with probability 1 . 

The following classification of deterministic processes will be useful later. It 
will be shown that any t.h.G.M. process is the direct product of processes of the 
following four deterministic types, and of a factor process with no deterministic 
factors. 

ilf(0). Let {x(0] be the variables of a one-dimensional t.h.G.M. process^ 
with x(t) = 0 with probability 1. (The chance variable which is 0 with prob¬ 
ability 1 is considered as a Gaussian variable with mean 0 and dispersion 0.) 
The correlation function of the process vanishes identically. 

Jlf(l). A one-dimensional t.h.G.M. process which satisfies 

(1.6.2) x(t) = x(0), E{x(t)l = 0 , E{x(ty} > 0, 

will be called a process of type M(l), The correlation function R(t) is positive 
and independent of L 

ilf(—1). A one-dimensional t.h.G.M. process with an integral-valued param¬ 
eter n, satisfying 

(1.6.3) • • • = a;(-l) = -x(0) = a:(l) = • • • Eix(n)} = 0, E{x(nfi > 0 

will be called a process of type Af(—1). The correlation function is alternately 
positive and negative: R(n) = (—l)"i 2 ( 0 ). 

A two-dimensional t.h.G.M. process with 

E{xm\ = 0, £:{a:,(0)*} = <r“ > 0, .B{xx(0)a:,(0)) = 0, 


(1.6.4) Xi(t) = a:i(0) cos t0 — Xi(0) sin t0 

Xi(t) — a:i(0) sin t0 -f- a:j(0) cos tO, 

will be called a process of type M{e'*). The correlation function is given by 


( 1 . 6 . 6 ) 


m) 


( ' 2 


cos tB 


(T^ Sin 


in 

os to) 


sin to a’ cos i 

A process with variables {x(0} will be called degenerate if there are constants 
Cl, • • • , Cat not all 0, such that 

(1.7.1) Z - 0 


with probability 1, for all L Equation (1.7.1) is true if and only if 
(1.7.2) ' Cixs(t)f] Z (R(0)h,CiC, = 0 

3 It* 

that is if and only if the correlation matrix iZ(0) is singular. If a non-degenerate 
process is a direct product of factor processes, the latter are also non-degenerate. 
The only degenerate one-dimensional process is that of type ilf(O). 
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2. The etructur e of defomiate determiiditic piooMiee 

Thsobbm 2.1. Every degenerate t.h.O.M. proceee is the direct ptadtud cf 
eesee c/ type MtfSi. and (in some eases) of a non-degenerate tJi.0.if. proeetfs. 

In proving this theorem, it can be supposed that the original process has been 
replaced by an equivalent process, if necessary, so that the symmetric non- 
negative definite matrix R(0) is in diagonal form, with only O’s and I’s down tbe 
main diagonal, say 0 to the rth place and 1 thereafter. Then Xj(t) =? 0, wlm 
j ^ V and the process is obviously the direct product of v processes Of type M(0) 
and an (N — i')-dimen8ional non-degenerate profile. 

Thbobeh 2.2. Let {x(0l be the variables of a deterministic tXOM. process, 
•with correlation function R(t). 

(i) The process is the direct produd of factor processes of types M(0), M(±l),: 
M(e\ 

(ii) If the parameter t of the process is restricted to the integers^ there is a rum- 
singular matrix A such that 

( 2 . 2 . 1 ) • x(n) = A’‘a;( 0 ), 

(2.2.2) R(n) = R{0)A*^, w = 0, =bl, ±2, • • • 

(2.2.3) 22(0) = AR(0)A*. 

The transition matrix A is the transform B0E~^ of an orthogonal matrix 0. If 
the process is non-degenerate, A is uniquely determined, 

(ii') If the parameter of the process runs through all real numbers, there is a 
matrix Q such that 

(2.2.1') x(t) = e'MO), 

— 00 < < < 00 

(2.2.2') R(t) - R(0)e^^ , 

(2.2.3') QR{0) + RiO)Q* = 0. 

The matrix Q is the transform BK B~^ of a skew symmetric matrix K. If the process 
is rum-degenerate, Q is uniquely determined. 

(iii) Conversely if 22(0) is any symmetric non-negative definite matrix, and 
if i4.(Q) is any matrix satisfying (2.2.3) ((2.2.3')), where A is non-singular, there 
is a deterministic t.h.G.M. process with correlation function given by (2.2.2) ((2.2.2')) 
and satisfying ( 2 . 2 . 1 ) (( 2 . 2 . 1 ')). 

In proving (i) (ii) and (ii') it will be permissible to go to processes equivalent 
to the original one, if convenient. Moreover if the given process can be expressed 
as a direct product, it will be sufficient to prove (i) (ii) and (ii') for each factor. 
Since (i) (ii) and (ii') are certainly true for processes of type 21/(0) (with A in 
(ii) the identity, and Q in (ii') the null matrix), and since according to Theorem 
2 . 1 , processes of type 2 lf( 0 ) can be factored out of the given process to leave a 
non-d^enerate remaining factor, if any, it will be sufficient to prove (i) (ii) and 
(ii') for non-degenerate processes. 
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Proof {i integral) of (i) and {ii) for rwn-degenerate processes. If the process 
determined by {a:(n)} is deterministic, (1.6.1) is true. Hence 

(2.2.4) x{v + 1) = Ax(v). 

Then (2.2.1) is true for n ^ 0, and will be established for all n as soon jus it is 
shown that A is non-singular. Using (2.2.1), 

(2.2.5) R{n) - E{x(0)-x(n)\ = i;(,T(0)*A"a;(0)} = i(!(0)A*^ n ^ 0, 
and 

(2.2.6) RiO) = /^{:r(l)•a:(l)} = £;{Aa;(0)• A.t( 0)} - Ai2(0)A*. 

Under the present hypotheses, J?(0) is non-singular. Then A is determined 
uniquely by (2.2.5) with n = 1, and A cannot be singular because of (2.2.6). 
There is an equivalent process in which R{0) is the identity. .Considering this 
process, (2.2.6) becomes I = AA*, so that A is orthogonal. Finally there is an 
equivalent process (obtained by an orthogonal (;hange of variables) in which 
/2(0) is still the identity and the matrix A is in the (real) normal form of or¬ 
thogonal matrices: all the elements of A are 0 except for two-dimensional rotation 
matrices or Fs or — Fs down the main diagonal. It is now obvious that the 
process is the direct product of processes of types ilf(±l), 

Proof {t continuously varying) of {i) and (if') for non-degenerate processes. If 
the t.h.G.M. process determined by {^(O} i^ deterministic, (1.6.1) is true. 
Hence 

(2.2.5') R{t) = i;{x(60-x(.s- + 0} = R{0)A{t)* 

(2.2.6') R{0) = E{x{s + t)^x{s + t)] = A{t)R{0)A{t)*. 

The matrix A(0 is uniquel}" determined l)y (2.2.5') since R{0) is non-singular. 
It then follows from (1.6.1) that 

(2.2.7) A{s + 0 = Ais)Ait). 

The continuity hypothesis (1,3.1) becomes 

(2.2.8) lim/e(0)A(0* = ii!(0), 

<-♦0 

which implies that 

(2.2.9) lim A(<) = I. 

It is well known that any solution of (2.2.7) and (2.2.9) can be written ih the 
form A{t) == e*^. If now the right side of (2.2.6') is expanded in powers of t 
and the coefficient of t is set equal to 0, the resulting equation is (2,2.3'). It can 
be supposed, going to an equivalent process if necessary, that R{0) is the identity. 
Then A(t)A{t)* = /, Q + 0* = 0. An equivalent process can be chosen so 
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that R(0) is still the id^tity, so that Q is in the md (sBacaucal lorm of jifecw. 
symmetric matrices; its elements vanish except for possible two rowed matnoes 

down the main diagonal. It is now clear that the non-degenerate process is a 
direct product of factors of type M(e'*). correspondmg to two row^ rnattfces 
just described, and factors of type Af(l). 

Proof of (m). If f2(0) and A(Q) satisfy the conditions of Theorem 1.2 (iii), 
choose *(0) as any Gaussian variable with correlation matrix /?(0). Then 
define x(n) by (2.2.1) ((2.2.1')). The resulting stochastic process is tmpor^y 
homogeneous if and only if E[x(s) •x(« -f t)} depends ohly on t. The details of 
the calculation will be carried out for only for i integral. 

In the first place 

(2.2.10) E{xin)-xin -f- i;)} - £{4’‘x(0)-A’‘+'a:(0)} = A”ii:(0)4*"+'. 

Now (2.2.3) can be developed further: 

(2.2.11) R{0) = ARiO)A* = A^R(0)A*^ = • • • 

SO that (2.2.10) reduces to 

(2.2.12) E[x{n)-x{n + r)} = /^(0)/i*^ 

The process is thus temporarally homogeneous, and obviously satisfies the other 
parts of the definition of a deterministic t.h.G.M. process. Theorem 2.2 is now 
completely proved. 

The restriction imposed on i?(0), A{Q) by (2.2.3) ((2.2.3')) is quite loose. 
Given 72(0), there is alwaj^s an A{Q) satisfying (2.2.3) ((2.2.3')) for example the 
identity (null matrix). Given an A which is the transform of an orthogonal 
matrix (a Q which is the transform of a skew symmetric matrix) there is always 
a corresponding 72(0): In fact il(Q) can be assumed to be orthogonal (skew sym¬ 
metric) and the 72(0) can be taken as the identity. 

3. TJI.G.M. processes with an integral valued parameter 

In this section, the parameter t will range through all the integers. The condi¬ 
tion (1.5.3) that a t.h.G. process be a t.h.G.M. process can be simplified in the 
integral parameter case. In fact it will be shown that it is sufficient if 

(3.1.1) E{ • • • , x(n - 1), x{n); x{n + 1)1 = E{x{n); x(n +1)}, 

n = 0, ±1, • • • 

with probability 1. If (3.1.1) is true, 

(3.1.2) x{v + 1) = Ax{v) + riiv) 
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where A is the transition matrix of the process and niv) has mean 0 and is inde¬ 
pendent of • • • , x(v — 1), x{v). It follows that 

x{n) = A^-^xim) + A""”''n(m) + A""”‘“%(m + 1) 

-h ••* + r,(n - 1). 

The terms involving the i;0) ^11 independent of • • • , x{in — 1), x{in). This 

equation therefore implies that 

(3.1.4) E{ • • • , x{m — 1), x(m); x(n)} = = E{x{m); x{n)\y 

and (3.1.4) is precisely the condition that the process has the Markoff property. 
The following lemma will be useful. 

Lemma 3.2. Let x:{xi, • • • , xi^) be any Gaussian chance variable, with £J{x} 
» 0. Then there is a uniquely determined symmetric non-negative definite matrix 
S, and a Gaussian chance variable y, such that 

(3.2.1) E{yy] - I 
and 

( 3 . 2 . 2 ) x^Stj, g'^E{x^x]. 

If X = Sy, and if S is symmetric, then the second equation in (3.2.2) is cer¬ 
tainly true. It is easily seen, by examination of the characteristic values and 
vectors of the matrix E[x-x] that this matrix has a unique symmetric non¬ 
negative definite square root S, Hence if there is an S satisfying (3.2.2), there 
can be only one. The chance variables Xi, • • • , Xat can be written as linear 
combinations of N uncorrelated Gaussian chance variables , • • • , {at satisfying 
=7: 

(3.2.3) X = A^. 

If A is written in the polar form A = SU, where S is symmetric and non-nega¬ 
tive definite and U is orthogonal, (3.2.3) becomes 

(3.2.4) X = SU^ - Sy 
where y ^ satisfies (3.2.1). 

It will be shown below that every t.h.G.M. process can be represented as the 
direct product of factors of certain types. The deterministic types have already 
been catalogued: ilf(O), Af(±:l), M(e*^), The non-deterministic factor type 
(integral valued parameter) will now be described. 

M, Let {riin)] be a sequence of mutually independent i\r-dimensional 
Gaussian chance variables with 0 means and a common distribution function. 
Let A be any iV-dimensional square matrix. Define x(n) by 


(3.3.1) 


x(n) * A'^riin — m) 
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where it is supposed tbat'^ A is so chosen that the series converges with jcH<c^ 
ability 1. This will be true, for exfuuple, .if all the characteristic values of 4 
have modulus less than 1, so that the terms of A* go exponentially to 0.“ It 
will be shown below that it is no restriction to assume that A has this character, 
The variables {x(n)l determine a t.h.G.M. process. Since a;(n)-A»(n — 1 ) 
is independent of • • • , x(n — 2), x(h — 1), the x{n) process is a Markoff process 
with transition matrix A: 

(3.3.2) E{ ••• ,x(n — l);x(n)} = 4a:(n — 1). 

A process defined in this way will be called a process of type M. A non-singular 
(fiiange of variables y{n) ^ J5x(n) leads to a process of the same type: 

(3.3.3) y{n) - £ {BABT^TB-nin - m). 

It will sometimes be convenient to write a process of type M in a form slightly 
different from (3.3.1). Using Lemma 3.2 it is evident that there are Gaussian 
variables {{(n)j satisfying 

(3.3.4) E{}i{n)\ - 0, £U(m).f(n)} = m, n = 0, ±1, • • • , 

and a symmetric non-negative definite matrix S such that riin) = /Sf(n). Then 
and 

00 

(3.3.5) x(n) = X) A"‘Sf(n - m). 

Under the change of variable y{n) = Bx{n), A l)ecomes BAB~^ and becomes 
BS^B^, 

The only condition on A required for convergence in (3.3.5) is that A'^S --> 0. 
It will now be shown that A can always be assumed to have only characteristic 
values of modulus less than 1, in the sense that there is an A with this property, 
and satisfying the equations 

(3.3.6) A'lS = m = 1, 2, • • • . 


It is no restriction, going to an equivalent process if necessary, to assume that 
the elements of A vanish except for those in two square submatrices down the 
main diagonal, where one submatrix Ai has only characteristic values of modulus 
less than 1 and the other, A ^, of modulus greater than or equal to 1. If the 
matrix S is written in terms of the corresponding subinatrices: 


(3.3.7) 



“ Throughout this piper, if A is any matrix, A® is defined as the identity matrix /, 

We shall use repeatedly Kolmogoroff’s theorem that an infinite series of mutually inde* 
pendent chance variables with zero means converges with probability 1 if the series of their 
dispersion is convergent. (Kolmogoroff only states the theorem in one dimension, but the 
extension to n dimension is trivial.) If a series of mutually independent Gaussian variables 
converges, the series of dispersions converges to the dispersion of the sum. 
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the condition on A implies that A 2 S 2 0. If it is shown thAt S 2 « 0, it will 
follow that Sa « 54 = 0, because 8 is S 3 rmmetric and non-negative definite. The 
matrix A will then be defined by 


(3.3.8) 



and A will satisfy (3.3.6) and will have only characteristic values of modulus 
less than 1. The problem has thus been reduced to the case where Ai is absent: 
A only has characteristic values of modulus at least 1, and it must be proved 
that 0 implies that <S = 0. The proof of this is immediate when A is put 

into its Jordan canonical form. 

The symmetric non-negative definite matrix S satisfies the equations 


(3.3.9) R(P) = E ^ + AR(0)A*. 

0 


Theorem 3.4. A direct product of processes of type M is also of type ikf. Con- 
versely any factor process of a process of type M is itself of type M, 

The direct part of the theorem is obvious. To prove the converse, suppose 
an AT-dimensional process of type M has an i-dimensional factor, corresponding 
to the variables xi{n), • • • , x/(n). It can be supposed, that all factors of type 
M{0) are separated out, so that there are indices j, k: 1 ^ j ^ I ^ k < N such 
that the { 0:1 (n), ••• , Xj{n)} and {xk+i{n), ••• , XAr(n)} processes are non-de¬ 
generate and that the variables Xj^i{n)y • • • , Xk{n) vanish identically. Making 
a change of variables, if necessary, it can be supposed that R(0) has the form 


(3.4.1) 



k — j 
0 
0 
0 



and that R{n) has the blocks of zeros indicated in (3.4.1), Since J?(l) == ft(0)ri’' 
A must have the form 


'j k - j N - k\ 

(3.4.2) .4:1 , Z 0 ) 

Then A" will have this .same form. Finally, because of (3.3.9), 5^, and therefore 
S must have the form 

/- 0 0 \ 

(3.4.3) 5*:( 0 0 0 1 

\0 0 -/ 

Let i4o be the matrix whose elements are the same as those of A except that the 
(j + l)th to Ath columns of A^ vanish. Since 

(3.4.4) ATS = A”S, m = 0, 1, • • • 



it follows that 
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(3.4.5) x(n) = .AySKn — m). 

)}|m0 

It is now obvious that the {xi(n), • • • , Xi ( n )] process is of type M. 

Theorem 3.6. A non-degenerate process of type M has no deterministic factors. 
Any factor process is non-degenerate and of type M, To prove the theorem, 
it will therefore be sufficient to prove that the process itseh cannot be deter¬ 
ministic;. If it were, we should have ^ 

x(n) — Ax(n — 1) =» S^(n — 1) = 0. 

Then = 0. But then the process is certainly degenerate, contrary to hy¬ 
pothesis. - 

Theorem 3.6. (i) Every i.h.GM, process {discrete parameter) is the direct 
product of processes of type M(0), M(±l), M. 

(ii) Let A he a transition matrix of a th,G,M, process^ with variables {a;(n)}. 
There are mutually independent Gaussian variables • • • , f(0), {(1), • * * , J satis- 
flying 

SffCn)} = = 0 

^U(w)-Kn)) =«{«•«}=/, 

and symmetric non-negative definite matrices T such that 
(3.6.2) x(n) = E A"‘Si(n - m) + A" T(, n = 0, 1, • • • , 

m—0 

(3.0.3) = Ar A*, 

(3.6.4) fi(0) = Z = AR(0)A* + S\ 


where the series in (3.6.2) converges with probability 1. If A is non-singular, (3.6.2) 
holds for all n. The sum and last term in (3.6.2) are linear transformations of x{n ): 
(3,6.2) exhibits in part the decomposition into factor processes described in (i). The 
correlation function is given by 


(3.6.5) 


R{n) = R{0)A*" 

n = 0, 1, • • • . 

ft(~n) - A^R{0). 


(iii) The transition matrix A is uniquely determined if and only if the process 
is non-degenerate. In any case, there is a transition matrix whose characteristic 
values are all of modulus less than or equal to 1, and whose characteristic values of 
modulus 1 correspond to simple elementary divisors. The transition matrix A 
furnishes the solution of the prediction problem of the process: 

(3.6.6) E{ — , x{m — 1), x(m); x{m -j- n)} = A'^xim), 


n = 0, 1, * • • . 
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The matrix 5*, which is uniquely determined^ measures the dispersion of x{n + 1) 
about its predicted value in terms of x{n ): 

(3.6.7) E\[x{n + 1) - Axin)f} = S\ 

(iv) Conversely if A is a matrix with at least one characteristic value of modulus 
less than lor of modulus 1 and corresponding to a simple elementary divisor, A is the 
transition matrix of a Uh.G.M, process, with R{0) not the null matrix. If all the 
characteristic values of A are as just described, A is the transition matrix of a nour 
degenerate t.h.G.M. process. If R{0), S, A are matrices satisfying (3,6.4) with 
12(0), S, symmetric and nonr-negative definite, there is a t.h.G.M. process whose 
variables can be written in the form (3.6.2) u)ith the given 12(0), S, A. 

This decomposition of a t.h.G.M. process into deterministic factors can be 
considered as a special case of the general decomposition theorem of Wold, which 
is applicable to all t.h.G. process,^® (Wold only considered the one-dimensional 
case.) The proof in the present special case is simpler, however, and illumi¬ 
nates the general case. 

Proof of (i) and (ii). Equations (1.5.3) and (1.5.4), in the present case, 
lead to 

(3.6.8) E[ • • • , x(n - 1), x(n); x{n + 1)1 = E[x(n); x(n + 1)} = Axin). 

The first two terms are equal because the process has the Markoff property. 
The last term is linear in x{n) because the process is Gaussian. The matrix A 
can be taken independent of n because the process is temporally homogeneous. 
Thus (3.6.8) involves the three fundamental properties of the x{n) process. 
From the definition of conditional expectation, it follows that x(n + 1) — Ax(n) 
is independent of the chance variables • • • , x{n — 1), x(n). Hence the variables 

• • • , [x(n) - Ax(n - 1)], [x{n + 1) - Ax{n)], • • • 

are mutually independent. According to Lemma 3.2, there are mutually inde¬ 
pendent chance variables {{(n)} satisfying 

(3.6.9) x(n) - Ax{n - 1) = Sli{n), £?{{(n)-f(n)} = /, E{ii{n)] = 0, 

where S is synunetric, non-negative definite, and satisfies (3.6.7), The matrix 
thus measures the dispersion of x(n) about its predicted value Ax{n — 1). 

The representation (3.6.2) can be obtained very simply. In fact 

x(n) = [x{n) — Ax{n — 1)] + Alx(n — 1) — Ax(n — 2)] -f • • • 

(3 g jOj + [xiv + .1) - Ax{y)] + A"-"'x(v) 

i-0 

A Study in the Analysis of Stationery Time Series, Uppsala (1938), p. 89. See also 
Kolmogoroff, Bull. Acad. Sci. URSS Ser. Math., Vol. 5 (1941), pp. 3~14 and Bolletin Afos- 
kovskogo Oosudarstvenogo, Mathematika, Vol. 2 (1941), pp. 1-40), in whose papers the de¬ 
composition theorem is brought out in its full significance. 
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and it will be shown that when> ^ — « (3.6.1<1) leads to (3.6.2), Befoi^ gi^Dt 
to the limit, however, we note that in (3.6.10) the sum is indepeodhnt.<3|11ie 
variables • v, a5(v—1), a:(i>), so that 

(3.6.11) * E{ ,x{p-l),x(v);x(n)] ^ A'^'xiv) 

which is another way of writmg (3.6.6). Moreover, uang (3.6.11), 

(3.6.12) R(n - y) = E{x(y) -x(n)} = It(0)A*’^\ 

which is another way of writing (3.6.5). (The value of R(n) for n < 0 is ob¬ 
tained using the fact that R(—n) = R(n)*.) The last term in (3.6.10) is the 
conditional expectation of x{n) for preassigned , x(y —1), x(y). It follows 
from the corollary to Theorem 1.2 that this conditional expectation conveiges 
with probability 1 when —» — «, but this convergence will be proved directly 
in tile present particular case. 

From (3.6.10), 

(3.6.13) E{x(n)-x(n)} = R(0) = "5* A^S^A*'' + A""R^A*"-'. 

;-0 

/ 

The terms of the sum and the last term are all symmetric and non-negative 
definite matrices. It follows that there is convergence in (3.6.13) when v —► 
— 00 : 

(3.6.14) Rm = E A’E‘A*^ + lim A’“ft(0)A*’". 

jmmO W-eoP 

The convergence of the series of dispersions in (3.6.14) implies that the series of 
chance variables in (3.6.2) converges, with probability 1. Then when s —> — w 
(3.6.10) becomes 

(3.6.16) x(n) = E -d'Sf(n — j) + s(n), 

#-o 

where 

(3.6.16) z(n) «■ lim j4"~’x(i<). 

Since x(n) is independent of f(n + 1), $(« + 2), • • • , 2(n) is independent of every 
{(m). Moreover, writing z(0) = T^, where f satisfies (3.6.1) and T is symmetric 
and non-negative definite, 

(3.6.17) 2 (n) = A\(f}) = n S 0. 

Thus (3.6.3) and (3.6.4) are satisfied. If A is non-singular, (3.6.17) will be cor¬ 
rect for negative n also. 

The decomposition of the proc^ into factor processes of the types described 
in the theorem will be obtained by a detailed analysis of the significance of 
(3.6.2). Under the change of variable y{n) = Bx(n), T® becomes BT^B*, and 
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A becomes BABT^. Making a suitable change of variables, if neceisaiy, it can 
be supposed that A has the form 


(3.6.18)’ 



where the characteristic values of Ai have moduli unequal to 1 and those Of At 
have modulus 1. The matrix T* can be written in terms of feubmatrices of the 
same dimensions in a corresponding way: 


(3.6.19) 



where Ti, Tt are symmetric and non-negative deHnite. A further change of 
variables may be made, if necessary (transforming only the last n variables) pre¬ 
serving the forms (3.6.18) and (3.6.19) and transforming Tt into the identity. 
Then using (3.6.3) 


(3.6.20) AiTlAt = Tl, AtAt = /. 


Hence At is orthogonal. Developing (3.6.20) further, AiT\Ai'” = T\, for all 
m S 0. When m oo in this equation, the terms in the matrix product on the 
left involve the mth power of the characteristic values of Ai (all of modulus 
different from 1, by hypothesis). Then those characteristic values which actually 
appear can only be those of modulus less than 1, and the matrix on the left must 
go to 0 as m 00 : Tl = 0. Since T is non-negative definite, T must have the 
.simple form 


(3.6.21) 



The matrix S can also be divided into corresponding submatrices: 

(3.6.22) -). 

The convergence of the series in (3.6.4) implies that 

lim A? Si A?”* = 0. 

m-^oo 

Since Ai is orthogonal, this means that S 2 = 0, and since S is symmetric and non¬ 
negative definite, S has the form 

(3.6.23) S:(®' 2)- 

It is now clear from (3.6.2) that the x(n) process is the direct product of a process 
of type M and a deterministic process corresponding to the division of the vari¬ 
ables determining the above submatrices. The deterministic factor process is 
the direct product of the elementary types already discussed. The variable 
s(n) and the sum in (3.6.2) are linear transformations of x(n). 
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Proof of (tit). If the proeees is non-degeii^te, ^(0) ia 
transition nmtrix is determined Uniquely by (3.6.5) mth n 1. II the pqoess 
is degenerate, there will be one or more factor processes of type M(0), apd th^ 
transition matrices are quite unrestricted. In the nonrdegenerate case the 
characteristic values will be of modulus less than 1 (corresponding to a factor nf 
type M, if one is present), or equal to 1 (corresponding to the factors of type M 
(d=l), M(e'*) making up the deterministic factor, if one is present), and.in the 
latter case the elementary divisors are simple. If the process is degenerate, and 
if the part of A corresponding to the factors of type M(0) is taken to be the iden¬ 
tity, there will be simple elementary divisors corresponding to the characteristic 
value 1 for each such factor. The remaining statements of (iii) have already 
been proved. 

Proof of (iv). Let A be a matrix with at least one characteristic value of 
modulus less than 1 or equal to 1 and corresponding to a simple elementary 
divisor. Then some transform BAB~^ has the form 

/At 0 0\ 

(3.6.24) ( 0 At 0 ) 

\ 0 0 At/ 

where At (if present) has only characteristic values of modulus less than I, A* 
(if present) is orthogonal, and both At , At are not absent. For the purposes 
of the present proof it can be supposed that A is already in this form. Define 
S, Thy 

fSi 0 0\ /O 0 0\ 

(3.6.25) 6 ’;{ 0 0 o) T:(o / o) 

\0 0 0/ \0 0 0/ 

where the indicated submatrices of S and T are in the same positions as those of 
A, and where St is any symmetric positive definite (and therefore non-singular) 
matrix of the proper dimension. The series in (3.6.4) converges and the first 
equation in (3.6.4) defines a matrix iJ(0) which obviously satisfies the continued 
equality. If all the characteristic values of A are as described in the beginning 
of this paragraph, As can be supposed absent. In this case 

'R(0) = S* + r" -h • • • 

is non-singular. The proof of the first two parts of (iv) has now been reduced 
to that of the last part. Suppose then that R(0), A, S satisfy the hypotheses 
of the last part of (iv). Then 

R(0) = Afi:(0)A* -h 

AR(0)A* = A*i2(0)A** + A5*A* 

A"~‘iZ(0)A*""‘ = A"fl(0)A*" -t- A""‘<S“A*"-‘. 


.(3.6.26) 
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Adding theeB equations 

(3.6.27) R(0) * 2 A”<S®A*" + A’'R(0)A*\ 

Thifi equation leads to (3.6.14), and T\ defined as the limit in (3.6.14), satisfies 

(3.6.3). Let • • • , {(—I), {(0), * • • , f be mutually independent Gaussian vari¬ 
ables satisfying (3.6.1). Then the x(n) defined by (3.6.2) determine the vari- 
abtes of a Gaussian process with non-negative values of n, but a slight modifica¬ 
tion is needed to obtain an expression defined for all n. To obtain this, it can 
be supposed that A, T, S are in the forms (3.6.24), (3.6.25). Define A by 

(3.6.?8) [ 

Then A is orthogonal and AT — AT, If now (3.6.2) is used to define x(n) for 
all n with A'^T replaced by A'^T, the x{n) process is a t.h.G.M. process with the 
desired properties. 

The properties of the prociess reversed in time are of some interest. It is easy 
to see that if n is replaced by —n, a t.h.G.M. process remains a t.h.G.M. process. 
If the original process is non-degenerate, the new transition matrix is 
/2(0)A*-ff(0)“"^ If the transition matrix remains unchanged when n is replaced 
by —n, /2(0)i4*i2(0)~^ = A, This is equivalent to the equation R{n) — R(—n), 
The simplest generalization of a t.h.G.M. process is the following. Let the 
chance variables {y{n)] determine a t.h.G. process with the property that for 
some iV > 0, 

(3.7.1) E\ ,y{n - 1); y{n)\ = E{y{n - AT), • • • , y(:n - 1); y{n)\, 

with probability 1. If iV = 1, the process is a t.h.G.M. process. This type 
process will be called a t.h.G.Mjv. process. To avoid notational complications 
only the one-dimensional case will be considered. The right hand side of (3.7.1) 
is a linear combination of the variables y{n — N), • • • , y{n — 1). The variables 
thus satisfy a difference equation of the form 

(3.7.2) y[n) - aiy{n - l) - ... - asyin - N) = 7i(n) 

generalizing (3.6.9), where rj{n) is independent of the chance variables • • • , 
2 /(n — 2), y{n — 1). The are mutually independent chance variables 

with zero means and dispersions independent of n. Equation (3.7.2) leads to 

(3.7.3) y(n) - - 1)- - N) = (n) 

(m ^ n) 

where ”*^(n) has zero mean and is independent of the chance variables • • • , 
y{m — 2), y{m — 1). Hence 

(3.7.4) E{ yVim- 1); y{n)] = E{y{m - N), ,y{m - 1); y{n)], 

, m S n. 
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The difference eqiiatitm (3.7.9) has been stiuiy^ in eonw detaff in the part/* 
We shall use an appro^ which adda insist into the structure of the feolutlcia 
and which clarifiee the place of the solution in the general theory of tJsLQ. 
processes. This approach is in terms of AT-dimentional t.h.G.M. processes. 
Define the variables {x(n)} of an W-dimensional process by 

(3.7.6) x/(n) = t/(n + j), n = 0, ±1, • • • , i * 1, • • • , JV. 


The x(n) process is evidently a t.h.G.M. process. If the index N of the p(n) 
process is the minimum for which (3.7.1) is true, the corresponding x(n) process 
will be non-degenerate. Then the transition matrix A is uniquely determined, 
and is evidently 


(3.7.6) 



1 0 • 

0 1 0 


. 0 

. 0 


0 1 


ffll/ 


Oi ja 0. 


The matrix S, measuring the dispersion of the prediction Ax(n — 1) of x(n), 
has the form 


(3.7.7) 


/O 


\o 


• 0 

0 8 , 


The characteristic equation of A is simply 

(3.7.8) a" - - • • • - o,r = 0. 


The matrix A has only a single characteristic vector corresponding to each char¬ 
acteristic value X, the vector (1, X, • • • , X*“‘). Hence if X is a multiple root of 
(3.7.8), it does not correspond to a simple elementary divisor. Therefore, ac¬ 
cording to Theorem 3.6, all roots of (3.7.8) of modulus 1 must be simple roots. 
It will be proved below that either no roots have modulus 1 or all roots have 
modulus 1. 

If an A^-dimensional non-degenerate t.h.G.M. process is given whose transition 
matrix A and dispersion matrix 5 have the form§ (3.7.6) and (3.7.7) respectivriy, 


Xy(n) — Xj+i{n — 1) = 0 

with probability 1, for j < N. Then a y(n) process can be defined unambigu¬ 
ously by (3.7.5). Since for fixed j, Xj(n) determines a one-dimensional t.h.G. 
process, the y(n) process is a t.h.G. process, and (3.7.1) is obviously true, with 
N minimal if A is non-singular. 

Case 1. 8 = 0 {deterministic case). In this case the x(n) process is deter¬ 
ministic, and the y{n) process satisfies the equation 

(3.7.2') y{n) = a^{n - 1) 4- • • • + any{n - N). 


Cf. for examine H. Wold, A Study in the Analysie Of Statumery Time Series, Uppsala, 

1988. 
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AH the roots of (3.7.8) are simple roots, of modulus 1. Siuce 8 =* 0, 

(3.7.9) x(n) = A'Ti n«0, ±1, • •• 

and therefore 

(3.7.10) v{n) = x,(n - 1) = Z (A”"* n = 0, ±1 • • • . 


Using either the well known form of the solution of the iVth order difference 
equation (3.7.2') or of the powers of an orthogonal matrix, it follows that 

N 

(3.7.11) y(n) = ^ {rjj cos nOj + fy sin nSj) 

7-*l 


where the rij and f y are (one-dimensional) Gaussian variables, and 

{cos $j + i sin ^y} 

are the N distinct characteristic values of A, that is the roots of (3.7.8). 

Case £. S 9 ^ 0 {non-deterministie case). In this case it will now be shown that 
the x{n) process can have no deterministic factors: that is that the roots of 
(3.7.8) all have modulus less than 1. In fact let be a root of (3.7.8), corre¬ 
sponding to the (diaracteristic vector z of A*: 


(3.7.12) 




Then using (3.6,4), 

(72(0)2, z) - {AR(fl)A*z, z) + {S\ z) 

(3.7.13) = (/i:(0)A*2, A*z) + (&, Sz) 

= \ff\\R{0)z,z)+\^\pr. 

Hence 1 13 | cannot be 1, and the x{n) process can have no deterministic factors. 
Equation (3.6.2) becomes 

(3.7.14) x{n) == A"/S{(n — m) 


which leads to 

(3.7.16) y(n) =»X Z (A”S)iyW» - m - 1) = s Z (A")iyfy(n - m - 1). 

jmmi m«0 m«0 

According to Theorem 3.6 the only restriction on the (ioefiicients ai, • • • , 
in the two cases S = 0, S 0, are respectively that equation (3.7.8) has N 
distinct roots of modulus 1 and all roots of modulus less than 1. Hence (3.7.10) 
and (3.7.15) furnish (with the stated restrictions on A) the most general 
t.h.G.M;^. processes. 
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It was shown in ThocHrem 3^6 that i£ JS(n) is the ccnrektion functim o^ 
t.h.G.M. process, R(n) can be expr^sed in the form (8.6,6), where A is some 
suitably chosen matrix. Conversely if the correlation function of a t.h.G. 
process has the form (3.6.6), the process is a t.h.G.M, process since x(n + 1) — 
Ax{n) is then orthogonal to (and therefore indepeiuient of) the vmiables 
• • • , x{n — 1), x{n), (This fact implies the truth of (3.1.1)). The character¬ 
ization of t.h.G.M. processes in terms of their correlation functions is thus 
easily solved. The following theorcjms characterize one-dimensional t.h.G.Mir. 
processes from various points of view. It will be convenient, and also intrin¬ 
sically interesting to treat at the same time a slightly larger class of processes: 
the class of component processes of processes. A one-dimensional t.h.G. 

process with variables {xi(n )} will be called a component process of an JV-dimen- 
sional t.h.G.M. process if there are iV* — 1 t.h.G. processes with variabltss 
{a^(n)}, ••• , {a;jv(w)} such that the iV-dunensional process with variableB 
{a;i(n), • • • , x^in)] is a t.h.G.M. process. If the variables {x(n)} detehnine 
an iV-dimensional t.h.G.M. process, the t.h.G. processes determined by 
{xi(n)}, ••• , {a:jyr(w)} will be called its N component processes. If an x(l%) 
process is not of type M(0) and is a component process of an JV-dimensional 
t.h.G.M. process, it is a component process of a non-degenerate iVi-dimensional 
t.h.G.M. process, for some Ni ^ N. It has already been seen that one-dimen¬ 
sional t.h.G.Miv. processes are component processes of iV-dimensional t.h.G.M. 
processes. 

Theorem 3.8. Let • • • , x(0), a;(l), • • • he the variables of a one-dimensional 
t.h.G. process. The process is a component process of an N-dimensional t.h.GM. 
process if and only if the chance variables 

(3.8.1) x(0), £?{••• ,x(-l),a:(0);x(n)j, u = l, 2, ••• 

are linearly dependent on the first N. 

Suppose that the x(n) process is a component process of an )\r-dimensioDal 
y{n) process: x(n) = with correlation function JS,(n) and transition matrix 
A. Since A satisfies ife characteristic equation 

(3.8.2) det \otI — A \ = o’ — — • • • — Ojv = 0, 

it follows from (3.6.2) that if ri{n + N) is defined by 

(3.8.3) yin + N) - a^in + iV - 1) - • • • - a^in) = ii(n + N) 

then ij(n + N) is independent of • • • , yin — 1), yin). Then 

(3.8.4) x(n + N) — aixin + N — i) — ••• — aj,xin) » ruin + N) 

where iii(n + AT) is independent of the chance variables • • •» x(n — 1), x(n). 
Equation (3.8.4) leads to 

(3.8.5) x(n + JV + v) — oi'’x(n + JV — 1) 


a^Mn)^ni’\n + N + p) 
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where ni’^n + N + v) has zero meiui and is indepeiuilent of the chance variables 
••• , x(n — 1), x{n). If the operator 

a:(n - 1), a:(n); •) 

is applied to this equation, the result is 


(3.8.6) 


E\ ■ • , x(n — 1), x(n); x(n + N + v)} 

- £ J?{ • • • , a:(n - 1), a:(n); a;(n + JV - w)} 

w—•! 


0 . 


The last term in the sum is x(n) and (3.8.6) is thus the desired linear relation. 

Conversely suppose that the (AT + l)th chance variable in (3.8.1) is linearly 
dependent on the first N: 

£{■■■ ,x(n - 1), x(n);x(n + N)} 

(3.8.7) , ^ 

a„El - ■ ■ ,x(n — l')x(n);x(n + iV — m)l, n = 0, ±1, 

•1 

Define the variables i/t(n), • • • , yff(n) of an A’-dimensional t.h.G. process by 
j/i(n) = x(n) 

(3.8.8) = E{-- - ,x(n - 1), x(n); x(n + 1) 
j/j»(n) - El - ■ • , x(n — 1), x(n); x(n + N - 1). 

Then 

El---,y(n - 1), y(n); yi(n + 1)} = yi{n) 

(3.8.9) E{--- ,y{n - 1), y(n); j/w_i(n + 1)) = j/w(n) 

..., y(n - 1), y(n); j/Ar(n + 1) j 

S 

= E[--- ,x{n - 1), x{n)]x{n + iV)} = (uys+i^in). 

ftlmml 

The y{n) process is therefore a t.h.G.M. process, with transition matrix (3.7.6), 
and the x{n) process is a component process. 

The following particular type of t.h.G. process will be involved in the proof 
of Theorem 3.9. If the chance variables {yjin)} determine a t.h.G. process 
whose correlation function i2,(n) vanishes when n ^ iV, then according to 
(1.3.5) the complex spectral function Cri,(X) of the rj{n) process is continuous, 
with derivative (?J(X) given by 

g;(x) - i e «.(«). 


(3.9.1) 
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2^ 

It 18 easily verified (using the fact iMt lZ;(n) » R^(—n) » that U k iB » 

root of riie equation • 

(3.9.2) E fi,(n)*" =0, 

then a, 1/a, 1/a are also roots, of the same multiplicity. Moreover if | « | » 
1, a is a root of even multiplicity, since the sum in (3.9.2) is real and non-negative 
when I « .| = 1. When | 2 | == 1, 

(3.9.3) I a 1 I (2 — a)(z — 1/a) | = [ 2 — a p. 

Hence <?i(X) can be written in the following simple form: 

(3.9.4) , (?;(X) = I + • - • + b^.i I* 

where the roots of the indicated polynomial have modulus at most 1, and the 
coefficients are real. 

Theorem 3.9. Lei • • • , x(0), a;(l), • • * be the variables of a (me^imenaioTial 
Lh,G. process. The process is a component process of a finite-dimmsional Lh,GM, 
process if and only if the complex spectral function is the sum of the integral of 
the square of the absolute vahie of a rational function of with real coefficients^ 
and of a monotone non-decreasing function increasing only in a finite number of 
jumps.^^ Specifically: 

(i) The process is a component process of an N-dimensional t.h,G,M, process 
if and only if the complex spectral function has the form 

(3.9.5) G(X) - [ 1?^'.’:^^-'-^^’ dX + (5(i) 

1 aoC -f- . . . -f- 1 

where 

(a) Cr{X) is a monotone non-decreasing function satisfying (1.3.3), increasing 
only by jumps, at no more than N points] 

(b) the denominator of the integrand vanishes at every discontinuity of 0{\), 
and the numerator vanishes at every zero of the denominator, to at least the 
same order] 

(c) the coefficients oo, • • • , ajv, jSo, * • • , are real, oo 9^ 0, 9 ^ 0 unless 

the integrand vanishes identically, and the roots of the polynomials in the 
integrand have modulus less than or equal to 1. 

The integral vanishes identically if and only if the x{n) process is a component process 
of an N-dimensional deterministic process, and (j{\) vanishes identically if and only 
if the variables x{n) vanish identically or the x{n) process is a component process of 
an N-dimensional t,h,G,M, process with no deterministic factor. 

(ii) The process is a t.h.G.Ms. process {deterministic case) if and only if the 

It is easily seen that the first term of the two can also be described simply as the in¬ 
tegral of a rational function of e*\ which is non-negative for X real and is an even function 
of X like all complex spectral density functions. 
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complex function (t(X) == (5(X) is a monotone non-decreadng function eatie* 

fying (1.3.3) increasing only in jumpsy at no more than N points] (non-deterministic 
case) if and only if the complex spectral function has the form 


(3.9.6) 



dh 

iN\ I I 

aoe + • • • + ttiv 


where oo, • • • , afr are real and oto 9^ 0. 

Proof of (i). If the x{n) process is a one-dimensional component of an iST- 
dimensional t.h.G.M. process, it has already been seen that for properly chosen 
real numbers ai, • • • , , (3.8.4) is true, where 77i(n + iV') is independent of 

the chance variables • • •, x(n — 1), x(n). Equation (3.8.2) can be assumed to 
have all its roots of modulus less than or equal to 1. It follows from (3.8.4) 
that T?i(n) is independent of -iji(m) if j n — m 1 N. The complex spectral 
function of the i;i(n) process is therefore continuous, with derivative given by 
(3.9.4). It will be no restriction to assume that ho 9 ^ 0 unless the derivative 
vanishes identically. According to (1.3.6), if G(\) is the complex spectral func¬ 
tion of the x(n) process. 


(3.9.7) 






£ 


,«nX I JNX ^ ^ |2 jri/\ \ 

J \e — Uie — • • • — aj\r I aC:r(X). 


Hence if Q{\) is the jump function of 0(\) = 0, and ^(X) is constant 

except for jumps at the same points as those of G{\), and of the same magnitude), 


(3.9.8) 


I ... ^ |2 ^ 

= £ 1 -1* dlGCX) - 5(X)] 


+ 


£ 


JNX 


as 




Since the first two integrals are continuous in X, the last must be continuous also. 
Hence the last integrand must vanish at every discontinuity of 5'(X), that is at 
every discontinuity of (?(X), and the last integral vanishes identically. It 
follows that 

(3.9.9) 0(X) - a(X) - [ I M"^+-+9.-.f ^ 

rf-T \ e — • • • — av I 


where the numerator vanishes at each zero of the denominator, with the same or 
greater multiplicity. Since the denominator vanishes at each discontinuity 
<rf (?(X), there can be at most N discontinuities. If the iV-dimensional process 
is a deterministic process, it can be assumed that all the roots of equation (3.8.2) 
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haye modulus 1, that is that the denominator and hence also i^e numerator in 

(3.9.9) have iV roots. This otm be true only if ilie numerator vanishes identir 
cally: 0(\) ss ^(x). If the i\r.<limen8ional process has no deterministic imUx, it 
can be assumed that all the roots of equation (3.8.2) have modulus less than 1. 
Then G'(X) can have no discontinuities: Cf(K) s 0. 

Conversely if G(X) has the form described in Theorem 3.9 (i), (?(X) can be 
assumed in the form (3.9.9) with real coefficients in numerator and denominator 
and the stated relations between the jumps of G(\) and the zeros of the numa'ator 
and denominator in the integrand. (If the integrand vanishes identically and 
if G(X) has N discontinuities, oi, • • • , Ow can be chosen as thase numbers making 
the polynomial 


vanish at the discontinuities of G{\).) Then 

R{n + N} — <i\R(n N — 1) — • • • — <iyR(n) 

_ f’' <x(n+i) [6o + • • • + bs-ie*^^ ^ ... 

(3.9.10) i, " 1 - aie‘>‘- 

+ - - a^]d&(\). 

The last integral vanishes since the bracket vanishes at every jump of ^(X). 
The denominator in the first integrand is the value on | 2 | = 1 of a polynomial 
all of whose roots are outside \z \ == 1, or on | | = 1. Any zero on [ z | =* 1 

corresponds to one of the numerator at the same point. The integral therefore 
vanishes if n ^ 0 (Cauchy Integral Theorem): 

(3.9.11) R{n + N) — aiR{n + — l) ~ — a^Riji) = 0, w ^ 0. 

This equation implies that 

(3.9.12) x{n + A) — aix{n + iV — 1) — • • • — asxin) 

is independent of the chance variables • • • , x{n — 1), x{n), that is that (3.8.4) 
is true, where rii{n + N) has the stated properties. It has already been seen 
in the proof of Theorem 3.8 that this implies (3.8.6) and that this in turn implies 
that the process is a component process of an iV^-dimensional t.h.G.M. process 
whose transition matrix A has characteristic equation (3.8.2). In particular if 
(j(X) ^ fi'(X), the roots of the characteristic equation are of modulus 1, so that 
the ATniimensional process must be deterministic. If ^(X) = 0, the x(n) process 
is a component process of att i^-dimensional process whose transition matrix A 
has only characteristic values of modulus less than 1. This AT-dimensional 
process can have no deterministic factors other than one or more of type Af(0). 
If these exist, (and if the x{n) process is not of type M(0)) they can be replaced 
by non-degenerate factors of type Af, to obtain an iV-dimensional process with 
no deterministic factor, having the x(n) process as a component process. 
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Proof of (it). If the x{n) process is a t-h.G.Mj^. process, (3.8.4) is true with 
iyi(m) indepaident of iyi(n) if m n. The discussion in (i) is therrfcwe simplified 
by the fact that the numerator in (3.9.9) is constant. If this constant is 0, 
the spectral function is a function of jumps: G(\) * fi'(X). If this constant is 
not 0, the denominator in (3.9.9) does not vanish, and &{\) therefore vanishes 
identically. The converse is proved as in (i). 

Thbobbm 3.10. (i) // Oi, • • • , dff are real numbers^ there is a ono-dirnmaional 
Lh.G. process not of type M(0) with correlation function R{n) satisfying 


(3.10.1) R{n + iV) - a,ft(n + ~ 1) - • * * ~ asR{n) = 0 

for n Q if and only if the equation 

( 3 . 10 . 2 ) ^ --= 0 


has at least one root of modulus less than or equal to 1. 

Let • • • , x(0), x(l), • • • he the variables of a one-dimensional t,h»G. process 
not of type ilf(O). 

(ii) This process is a component process of an N-dimensional Lh.GM. process if 
and only if the correlation function R{n) satisfies an Nth order linear difference 
equation (3.10.1) for n ^ 0. 

(iii) The process is a tJi.GMn- process if and only if the difference equation 

(3.10.1) is true for n ^ — (iV -- 1). In this case the vectors (x(n), ••• , 
x(n + iV — 1)} determine an N-dimensional tJi,GM, process. 

(iv) Equation (3.10.1) is satisfied for n ^ if and only if 

(3.10.3) x{n + N) — aix(n + iV — 1) ~ • — a^rxin) =0, n = 0, ±1, • • • . 

Proof of (ii), (iii), (it;). Let the x(n) process be a component process of an 
^^--dimensional t.h.G.M. y(n) process with correlation function Ry{n) : x{n) = 
yi{n), and transition matrix A. Since A satisfies its characteristic equation 
(3.8.2), it follows from (3.6.5) that 

(3.10.4) Ry(n + N) — aiRy(n + AT — 1). — • • • — anRyin) =0, n ^ 0. 

Then R(ri) — (/2y(n))u satisfies this same difference equation. Conversely if 

(3.10.1) is true for n ^ 0, it has already been proved in the course of the proof of 
Theorem 3.9 that the x(n) process is a component process of an iV-dimensional 
t.h.G.M. process. This finishes the proof of (ii). Parts (iii) and (iv) are proved 
similarly. 

Proof of (t). According to (ii), if there is a one-dimensional t.h.G. process 
whose correlation function R(n) satisfies (3.10.1) for n ^ 0, the process is a com¬ 
ponent process of an AT-dimensional t.h.G.M. process whose transition matrix 
A (3.10.2) as characteristic equation. Since A has at least one characteristic 
value of modulus less than or equal to 1, (unless the x(n) process is of type 
ilf(0)), (3.10.2) must have at least one root of modulus less than or equal to 1. 
Conversely if (3.10.2) has at least one such root, there is a real iV-dimensional 
matrix A whose characteristic equation is (3.10.2), and which has simple ele- 
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mentary divisors. According to Tbeorem 3.6 (ii), A is then the trao^tion 
of some t.h.G.M. process. The oorrelaticm function of this process and h^ioe 
that of each component process satisfies (3.10.1) for n g 0. 

Theorem 3.11. (i) Z/ Oi, • • • > are real numbersy there is a (me-^imenHofuU 
LKG. 'process not of type M{0) satisfying 

(3.11.1) xin + N) — aix{n + JV — 1) — • • • — aitx(n) * i;(n + AT), 

n =s 0, =fcl, • • • 

vhth flint), f)in) independent for I m — n \ ^ N if and only if (3.10.2) has ai least 
one root of modulus less than or equal to 
Let • • • , a:(0), a;(l), • • • he the variables of a one-ditnensional Lh.G, process. 

(ii) This process is a component process of an N-dimensional Lh.G.M. process 
if and only if (3.11.1) is true vhth riim), i|(n) independent /or | m — n | ^ iV. 
In this case rj(n + N) mil be independent of the chance variables • • • , xin — 1), 
xin). 

(iii) The process is a t.h.G.M. process if and only if in addition to the condition 
in iii), riin) is independent of the chance variables • * • , xin — 2), x(n — 1); de- 
terministic case if riin) = 0 vhth probability 1, nondeterministic case othervhse. 

Since this theorem follows readily from the preceding theorems, the proof will 
be omitted. 

The problem of predicting xin) in terms of • • • , xin — 2), xin — 1) is trivial 
(theoretically at least) for t.h.G.Miv. processes. In fact these were defined as 
those processes for which the solution of the prediction problem is simply a 

linear combination ^ Ujxin — j) of the preceding N variables. The solution 

will now be given for the more general class of component processes of A'-dimen- 

sional t.h.G.M. processes, processes which have been described from several 

points of view in the preceding theorems. 

The prediction problem for component processes of iV-dimensional t.h.G.M, 

processes will be put into a more general setting. If the one-dimensional chance 

variables {x(n)) determine a t.h.G. process, with correlation function Rin), 

the problem of finding £{ • • • , xin — 2), xin 1); x(n)} is that of finding a 
00 

series ^ymxin — m)^® such that 

m-il 

00 

(3.12.1) xin) - 23 ymxin - m) 

m«l 

is imcorrelated with every xin — r) (r > 0): 

(3.12.2) Riv) — ymRiy — »t) = 0, r > 0. 

mmrnl 


We are neglecting all convergence difficultiee. They become trivial for tho appHca- 
tigga to be made below. 
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If the complex spectral function is (?(X), (3.12.2) becomes 


(3.125) 


[[ e"'|l - E dGi\) = 0, 


0. 


Let C?(X) be the integral of its derivative (?'(X), that is let (7(X) be absolutely con¬ 
tinuous. According to (3.12.3) the problem reduces to that of finding a function 


(3.12.4) 




2 — e 


.A 


such that/(«)(?' is of power series type, a function corresponding to an expansion 
in non-negative powers of z. The dispersion of the error of the prediction is 

x(n) - 53 ymx(n - m)l) = f dG(\) 

tnmml J J tl-T 

f"i/rd<?(x). 


E 


(3.12.5) 


{[ 


In particular if the x(n) process is a component process of an iST-dimensional 
t.h.G.M. process, (?(X) is given by (3.9.6). It will be supposed throughout the 
following that d = 0. Then 


(3.12.6) 


0'<\) a= g(A)Z^ ^ + • • • + 0K-l)iffo + • • • + *) 

(aiZ*^ + • • • -f* aw)(ao + • • • + ast") 


z = e 




In this case / s 1 if G's 0, and otherwise / is given by 

ffo(otoz" + • • • + an) 


(3.12.7) 
so that 

(3.12.8) 


/(*) = 


aoz(/3o2^“* + • •' + ffn-i) 


!/(*) r = 


alG' 


The dispersion of the prediction error is jS( 0) if 6's 0 and otherwise is 2ir0l/<4 • 
The prediction formula for x(n) in terms of the variables • • • , x(n — v — 1), 
x(n — y) has now been derived for v = 1, for the chance variables under discus¬ 
sion in this section. The solution for general v is easily obtained. 

As I' —+ <», the prediction converges with probability 1, according to the 
corollary to Theorem 1.2. If the process is a component process of an N- 
dimensional t.h.G.M. process, and if ^ b 0 in (3.9.5), the limit is 0. That is, 
in this case, the best pre<iicted value of x(n) in terms of the distant past is near 
£{*(n)) = 0, the same predicted value which would be used with no knowledge 
of the past. 
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4. RroceMM wbose pmoneter i vaxies oeotiiuiMUi^ 

Tlie basic process in terms of which t.h.G.M. processes without'determiiuStic 
factors were expressed in section 3 was a process whose variables {{(n)) werO 
Gaussian, with 

(4.1.1) ^{f(n)l = 0, i?{f(m)*{(n)l = S„.J. 


Hie corresponding proce^ in the continuous parameter case is not obtained by 
replacing the integral parameters m, n in (4.1.1) by continuous parameters, In 
fact the process so defined does not satisfy any useful continuity conditions. 
In the present discussion, sums like X) ^m£(m) will be replaced by Stieltjes 

int^rals j A{t)d4(t), and df(t) thus will correspond to f(n). The {(0 process 
is defined as follows. For any h < • • • < L, the chance variables 


$(f*) “■ f(fl)l • • • . i(0 — f(fn-l) 

are mutually independent JST-dimensional Gaussian chance variables, and if 
a <t, 


( 4 . 1 . 2 ) Em - 1 ( 8 )} = 0 , E[m) - m-m) - mw = « - «)/. 
« 

This process, called simply a {-process below has been discussed in great detiul 
by Bachelier, Wiener and L4vy. The function {(<), considered as a function of t 
is known to be continuous with probability l.^’^ The derivative {'(f) does not 
exist, since {[{<(< + h) — {<(<)]*} is proportional to h, whereas this mean would 
be proportional to if {'(<) existed. In fact it has been shown that {(<) is 
(with probability 1) not even of bounded variation in any finite interval. How¬ 
ever, if /(<) is a function defined and continuous for o ^ f ^ & (where o or 6 or 
both may be infinite), the integral 

(4.1.3) 


/ /(«) m 


can be defined as the limit in the mean of the usual Stieltjes sum. If J{t) has a 
continuous derivative, the integral in (4.1.3) can be evaluated by inte^ation 
by parts: 

( 4 . 1 . 4 ) jfV(o diit) = mm - mm - [ mrxt) dt. 


Integrals of the following type will be used below: 

y (0 “= f /(< “ t) d{(T) = /(0){(0 — /(t — a)m 

•'a 

+ jf - t) dr 


(4.1.6) 


” Paley and Wiener, “Fourier transforms in the complex domain,” Am. AfcdA. 8oe. 
CoUoq. Pvb., Vol. 19, p. 148. 
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m 

where f(t) is continuous and has two continuous derivatives. It is then evident 
that ylt) is continuous, but that y'{t) exists if and only if /(O) = 0. If /(O) «■ 0, 
y'{t) is given by 

(4.1.6) y'{t) - fnt - r) dt(T). 

Ja 

A more general process will also come into the discussion below, and will be 
called a f-process. The chance variables {r(0} of a f-process are Gaussian, and 
have the same independence property as the variables of a {-process. The 
second equation of (4.1.2) is dropped, so that (4.1.2) is replaced by 

(4.1.7) Em - r(o)} - 0, E{m ^ mhm - mh - m), 

where the symmetric and non-negative definite matrix D(t) will sometimes be 
supposed to have special properties, such as continuity in t, etc. The inde¬ 
pendence property of the f-process implies that 

(4.1.8) Eim - Us)Hm - fWll = D(t) - D(s). 

Hence D\t) (if this derivative exists) is symmetric and non-negative definite. 

Theorem 4.1, If the dispersion matrix D(t) of a ^-process is continuoits, the 
functions {f(0} ore continuous in t, with probability 1. 

The component processes of a f-process with a continuous dispersion function 
are also f-processes with continuous dispersion functions. Hence it will be 
sufficient to prove the theorem in the one-dimensional case. In this case D{t) 
is non-negative and monotone non-decreasing, according to (4.1.7) and (4.1.8). 
It can be supposed that D(t) does not vanish identically. Let Di{t) be an inverse 
function of D(t): D[Di{t)] == t. Then {(0 = f[Di(0] defines a f-process, and the 
continuity of f(0 implies that of f(<). 

The integrals of type (4.1.3) are defined for f-processes as for f-procasses, and 
satisfy the equations 

git) df (o| = 

A(t) dUt)f' B{t) dr(t)| = £ Ait)D'{t)Bit)* dt, 

where /, g are numerically valued functions and A, B are matrix functions.^* 
The f-processes lie at the basis of t.h.G. processes. To every t.h,G. process 
(discrete parameter) with variables {x(n)} correspond two one-dimensional 
f-processes with variables jfi(0}, {fsCO} such that 

(4.1.10) x(n) » cos nX dfi(X) + sin nX df 2 (X) 

These equations are easily proved using the fact that each integral can be approxi- > 
mated by the usual Riemann-Stieltjes sums. 


fmg{t)D\t) dt 


( 4 . 1 . 9 ) Eiffit)dm-f 
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wixete the two f-proc^eases are mutually mdepeudent in the sense that eveiy 
ft(Xi) is independent of every ri(Xft) and where, if 0(X) is the complex speotk^ 
function of the process, 

(4.1.11) E{U\f]^0{\). 

In the continuous parameter case (4.1.11) becomes 

(4.1.12) x(() XX £ cos tXdtiW + sin tXdt^iX). “ 

This theorem of Cram6r shows that x(n), or x{i) as the case may be, is the limit 
of a sum of sines and . cosines, with Gaussian chance variables as coefficients. 
The dispersion of each coefficient, which measures the intensity of the cor¬ 
responding periodic term of the sum, is determined by the spectral function of 
the process. In particular, if the spectral function F(\) is the integral of its 
derivative F'(X), each integrand involving dft(X) in the above equations can be 
replaced by one involving \/F\\)clii(X) where {<(X) is the variable of a {-process. 
Thus in many important cases the processes can be written in a simple way in 
terms of {-processes. 

It will be shown below that every t.h.G.M. process can be represented as the 
direct product of factors of certain types. The deterministic types have already 
been catalogued: M(0), Af(l), Af(6*^). The standard non-deterministic type, 
as in the discrete parameter case, will be called type M, 

M, Let {{(0) be the chance variables of an iV-dimensional {-process, as de¬ 
scribed above. I-«et Q be an iV-dimensional square matrix, and let S be an iV- 
dimensional symmetric non-negative definite matrix. Define x{t) by 

(4.2.1) xit) = jJ* e‘^Sm - s) = d^ir) 

where it is supposed tliat the improper integrals converge with probability 1. 
There will be convergence, for example, if Q has only characteristic values with 
negative real parts so that the elements in the matrix go to 0 exponentially 
as s —> 00 . (Cf. section 1.) It will be shown below that it is no restriction to 
assume that Q has this character. The x{t) process is evidently a t.h.G. process. 
It u < ty the chance variable 

(4.2.2) x(t) — x(w) « j e“'®Sd{(«) 

is independent of x{v) for i> g u, since x(v) is expressed in terms of {(») for 8 ^ v. 
Therefore the x{t) process is a Markoff process with transition matrix A(() = 

(4.2.3) E{x(v)y V ^ u; a;(0) = u < L 

“ H. Cramer, Arkiv Far Matemaiiky Astronomi ock FyHky Vol. 28B, No. 12, pp. 1-17. 
Cramer only .discusses the continuous parameter case, but the other requires no change of 
method. He allows complex-valued ^processes, in terms of which (4.1.10) and (4.1.12) 
asinime a more elegant form. 



3. L. DOOB 


A process defined in this way will be called a process of type M. A (diange of 
variable y(t) <= Bx(t) leads to a process of the same t}^: 

(4.2.4) yit) = BS di(t - s). 

The matrix Q goes into BQBT^ and if SiO is the polar form of 55, where S is 
symmetric and non-negative definite and 0 is orthogonal, S goes into Si. (We 
are using the fact that Ot(0 defines a second {-process.) The correlation func¬ 
tion of a process of type M is easily calculated: 

(4.2.5) 5(0) = j[ ft(<) = i2(0)e‘«*. 

The only condition imposed on Q is that the improper integrals in (4.2.1) con¬ 
verge. This condition is easily seen to be equivalent to the convergence of the 
integral in (4.2.5). This in turn is equivalent to the condition that 

(4.2.6) lime'^S = 0. 


This condition is certainly satisfied if the characteristic values of Q all have 
negative real parts, and it can always be assumed that this is so. (Cf. the cor- 
i^ponding discussion of processes of type M in the discrete parameter case.) 

The analogues in the continuous parameter case of Theorems 3.4 and 3.5 are 
true. The proofs are substantially the same as the proofs in the discrete param¬ 
eter case, and will be omitted. 

Theorem 4,3. (i) Every Lh,G,M» process {continuous parameter) is the direct 

product of processes of type Af(0), ilf(1), Af(e*®), ilf. 

(ii) If x{t) areihe variables of such a process, there is a matrix Q such that A{t) = 
>e^^ is a transition matrix function. There is a ^process, a Gaussian variable f, 
independent of the ((t), satisfying 

(4.3.1) 5{{} = 0, = / 


and symmetric non-negative definite mxUrices S, T such that 

i(() - r - «) + e"r{ 


r «'■ 


■*'®Sdf(s) + 

Qr* + = 0, 

ft(0) 




Sdiis) + e‘«a;(0), 


(4.3.2) 

(4.3.3) 

(4.3.4) 

(4.3.5) 

tpliere Ihe integrals in (4.3.2) converge with ■probability 1. The integral and the 
last term in each pair in (4.3i2) are linear transformations of x{t ): (4.3.2) exhibits 


ds + r 


QB(0) + R(0)(r - -S®, 
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in paH the ikcompoeition into factor processes described in (i). The coinretioHm 
function is given by . 


(4.8.6) 


R{t) = R(0)e'^- 

i ^ 0 . 

R{-t) ^ e'^R{0) 


(iii) The matrix Q is uniquely determined if and only if the process is non- 
degenerate, In any case there is a Q whose characteristic values all have negative 
or zero reed parts and whose characteristic values with zero real parts corresp^ to 
simple elementary divisors. The matrix Q furnishes the solution to the prediction 
problem of the process*. 


(4.3.7) E[x(8)y 8 S t; x{t + 7i)} == u > 0. 


The matrix Sy which is uniquely determinedy measures the dispersion of x(t) about 
its predicted value: 

(4.3.8) E[[x(t + u) - - uS^ (u~^0). 

(iv) Conversely if Q is a matrix with at least one characteristic value with negative 
real part or with zero real part and corresponding to a simple elementary divisory 
is the transition matrix function of a t,h,G.M. process with R{0) not the null 
matrix. If all the characteristic values of Q are as just describedy e*^ is the irarm- 
tion matrix function of a non-degenerate t.h,G.M, process. If -B(O), Sy Q are 
matrices satisfying (4.3.5) with i2(0), S symmetric and non-negative definitey there 
is a t.h.G.M. process whose variables can be written in the form (4.3.2) with the 
given i2(0), S, Q. 

The proof of Theorem 4.3 follows closely that of Theorem 3.6, and the details 
will not be given, except as they differ from those of the earlier proof. 

Proof of (i). Suppose that the {x(0} are the variables of a t.h.G.M. process 
which is non-degenerate. The transition matrix function A(t) is then uniquely 
determined by (1.5.5). Take the conditional expectation of both sides of 
(1.5.4) for given x{0): 


(4.3.9) 

i4(s + t)x{0) = i4 (<)^ (s)x(O) 

8,t>0. 

Since the 

process is non-degenerate, 


(4.3.10) 

A(s -f- <) = A(8)A(<) 

»,t> 0. 


According to (1.3.1) and (1.5.5) 

(4.3.11) lim fi(0 =» lim R{0)A{t)* = i2(0), f > 0. 

1-^ <-*o 

Hence 

(4.3.12) lim Ait) == I. 
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It has already been noted that any solution to (4.3.10) under the continuity 
hypothesis (4.3.12) can be written in the form 

(4,3.13) A(t) - 

where 


Q 


lim 

<-•0 


A(t) - / 

i 


Under a change of variables y(t) = Bx(t)j A{t) becomes BA{t)B^^ and Q becomes 
BQB'~\ According to Theorem 2,1, if the x(0 process is degenerate, it is the 
direct product of one or more factors of type ikf(O) and (perhaps) of a non¬ 
degenerate factor. The matrix Q of a factor of type Jlf(O) can be taken as the 
null matrix. Then the form (4.3.13) is admissible for any t.h.G.M. process, 
although Q will only be uniquely determined if the process is non-degenerate. 
Define f (0 by 


(4.3.14) f(0 = A(tr^x(t) = e~‘®x(0. 


Then if Si < < S 2 < fe 


( 4 . 3 . 16 ) Eim - f(80i = 0, Emk) - - r(«i)]} = o 

and 


(4.3.16) D(t) - Eim - f(0)]-ff(0 - f(0)]} - - ft(0). 

Hence the {{'(01 determine a f-process, with dispersion matrix given by (4,3.16). 
The derivative D\t) is easily evaluated; 

(4.3.17) D'{t) = e-*^[-R(0)Q* - Qft(0))e““'*. 

Since D'(0 is symmetric and non-negative definite, the bracket also has this 
property, and there is a non-singular matrix Si such that 

(4.3.18) Si[-Qfi(0) - R(0)Q^]St - U, 

where U is in diagonal form, with only O's and Ts in the main diagonal. Then 
the integral 

(4.3.19) j[*-Sic*®dr(«) 

defines a f-process with dispersion matrix tU. There is therefore a ^process 
with variables {((0} such that 

(4.3.20) Ui(t) ^ Sie’Uiia). 

This equation can be solved for f(l) and xit): 

(43.21) x{t) - #(«) + e‘*‘x(0) 
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Wfaete iSi « £^^^. The matrix iS^C/oaiibe writtnikt thepoUr fona <80 S 
is symmetric and non-negatiTe definite and 0 is cnthagoital. S k' the iS 
of (4.3.2) etc. 

The remainder of the proof follows closely the proof of Theorem 3.B and wffl 
be omitted. 

An important class of t.h.G.M. processes which arises frequently in physical 
applications is obtained in the following way. Let {£(0) be Ibe variables a 
one-dimensional ^process. Consider the formal equation 


(4.4.1) 


dt" ‘ 


- a^yit) - c€'(f), 


where Oi, ■ ■ • , ajr, c are constants. This equation cannot be contidered precise 
as it stands, since ^(f) does not exist. The problem can however be reformulated 
as follows: find a y(t) process, where j/', • • • , are supposed to e»st, satis-i 
fying the equation 


(4.4.2) 


[mdy^'^-^\t) - at fj(t) dy^’'^\t) - •• 

- a;, fjim) dt^cf^ m diit) 


with probability 1, for each continuous function /(f) and each pair of numbers 
a, b. The formal integrals are defined as the limit in the mean of the usual 
sums.*® The integral on the right has already been discussed. With this 
interpretation, equations involving can be treated in the usual way, and this 
will be done in the following without further comment. The formal solution 
of (4.4.2) is well known. Let Xi, • • • , Xjy be the roots of the equation 

(4.4.3) - • • • - Oat = 0 

and suppose that these roots are distinct, and have n^ative real parts. Let 
Ajh be the cofactor of X*~‘ in the determinant 


(4.4.4) S 


1 

• 

1 

Xi 

X2 

• Xjv 

xr* 

A2 

• xr 


Then the general solution of (4.4.1), that is to say of (4.4.2), is 

(4.4.6) y(t) d{(») + i E A*ye"'‘y‘*-«(0). 

Since the integrand and its first AT — 1 derivatives vanish when « » y', • • • , 

as defined by (4.4.5) exist, but does not exist, because {'(0 in (4.4.1) 


** For a full discussion in the case JV 1 cf. Doob, Annals of Maih,^ Vol. 48 (1942), pp.' 
858-^1. 
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does not exist. The y{t) process is a t.h.G. process if y(0), • • • ,y^’' ^’(0) are 
chosen propedy. This can be seen from the solution 

(4.4.6) y{t) i d{(s). 

^ 0 J^90 /-I 

In ffW5t this is the only solution defining a t.h.G. process. To prove this, rewrite 
(4.4.5) in the form 

(4.4.60 y(0 = 5 E A«y f‘ df(s) + i E A*,e"''‘-"’ y'*-”(r). 

0 y-i Jf-T o i,i-t 

If the y(t) process is a t.h.G. process, (4.4.6') becomes (4.4.6) when 
Thus there is a unique stationary solution to (4.4.1) and, by (4.4.6), every solu¬ 
tion tends to this solution in the long run. The statioirary solution (4.4.6) 
has the property that y(t) is written in terms of {(s) for s ^ t. Then in (4.4.6) 
the integral is independent of the terms involving the initial conditions. In 
other words 

(4.4.7) E{y(s), s g 0; 2/(0} = i E A*,c"'‘2/“‘”(0). 

O uM 

Hence the variables y(i), y'{t)y • • • , define an AT-dimensional t.h.G.M. 

process. The transition matrix function ^4(0, and the matrices Q, 5, T of 
Theorem 4.3 are easily calculated. 

^(0:(A*yXr‘e""), 

1 0 0 
0 1 

(4.4.8) Q:' 


r = 0. 

The necessary changes to be made if the Xy are not distinct are well known. 
The case c = 0 will be treated below, when the problem will be reconsidered 
from another point of view. In all cases the solution of (4.4.1) leads to an 
i^-dimensional t.h.G.M. process.^^ 

As a simple example, consider a torsion pendulum, suspended in a sealed 
container. The only turning forces acting on the pendulum are the molecular 
shocks of the surrounding gas, and the restoring torqtie. The equation of mo¬ 
tion is 

(4.4.9) + a,y(() = X((), 

According to a letter from Uhlenbeck, the differential equation (4.4.1) was solved; 
from a somewhat different point of view, by Miss Ming Chen Wang, in a thesis written in 
1941 which is unfortunately inaccessible to me at the moment. 
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wlwre y is the angular displaoetuient measured from the equilibrium po^OO, 
/ is tile moment of inertia, on is the torque coefiELdent of the suspendon, and tim 
molecular force is resolved into a systematic Stokes term aip' and a reminder X. 
The remainder term X(t) defines a stationary process which to a first approrima- 
tion is “purely random.” In the present context “purdy random” means that S 
*!<•••<<,, X(ti), ■ • • , X{t,) are mutually independent. This is {Hecisely 
the property the derived process of a ^process would have, if f'(0 eidsted. 
Unfortunately it has already been noted that {'(f) does not exist, since tlm dif¬ 
ference quotient ({(f + h) — {(<)]A is unbounded as h —> 0. It has already 
been seen, however, that (4.4.9) can be given a meaning with X(f) identified 
with c{'(f) even though {'(f) does not exist, and it has been seen that the solution 
approaches a steady state. It may still be a disappointment to some that the 
solution y{t) has a first derivative j/'(f) but that y"(t) does not exist: there is an 
angular velocity but not an angular acceleration! This unhappy circumstance 
can either be blamed on the physical world, or on the mathematical approxima¬ 
tion to the physical world, depending on the point of view. The corresponding 
electrical picture is the following. There are spontaneous currents in any 
electrical circuit, due to the thermal motion of the electrons. This is known as 
the Johnson effect. In a simple closed circuit, consisting of an inductance L, 
a resistance, R, and a capacitance C in series, the current equation can be written 
in the form 


(4.4.10) 



, jf dy{t) y{t) 


= X(f), 


where y is the charge on the condenser and X(0 represents a fictitious voltage 
set up by the motion of the electrons. The X(f) is identified with cf'(0 as 

du 

before. In this case there is a current , but the current function has no 

at 

derivative. In these applications, the physical justification for the Gaussian 
character of the ^distribution lies in the Gaussian character of the Maxwell 
distribution of elementary particle velocities. The known mean particle kinetic 
energy determines the constant c in (4.4.1). The more complicated mechanical 
or electrical systems will lead to equations of higher order than 2, or systems of 
equations. For example the usual current equations of a net or resistances 
capacitances and inductances lead to a system of say v second order equations 
of type (4.4.10), and the corresponding pairs y, y' form a 2 v-dimensional t.h.G.M. 
process.** 

The processes defined by linear differential equations of tl^e type (4.4.1) ace 
the analogues of the t.h.G.Mi(r. processes in the discrete parameter case. Instead 
of defining these solutions of (4.4.1) as the t.h.G.Mjv. processes, however, we 
shall use a definition closer to the definition in the discrete parameter case. A 


Further diecuimion and references to papers by physicists on this subject will be found 
in Doob, Annals of Math., Vol. 43 (1942), pp, 351-69. 
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one-dimensional t.h.G. process with variables |j/(0} will be called a t.h.G.Mw. 
process if the derivatives y'it), • • • , exist, and if whenever s < t, 

(4.5.1) J5|y(r), r ^ s; j/(0} = E\y{s), y'(s), ■ ■ ■ , yit)]. 

UN = 1, the process is a t.h.G.M. process. The right hand side of (4.5.1) 
is a linear combination of the variables y(s), • • • , The variables 

{{/(<)) thus satisfy an equation of the form 

(4.5.2) y(t) - Oi(< - s)y(s) — ... — ow(< - s)y^"~'\s) - yis, t) 

where y(8, t) is independent of the variables {j/(t)} for t S s. Define the vari¬ 
ables {x(01 of an A/'-dimensional t.h.G. process by 


(4.5.3) 


a:i(0 = y(t) 

i = 1, ••• ,iv-1. 


If this process is degenerate, there is a relation of the form 

(4.5.4) co 3 /(«) + ciy'is) -f • • • -|- Cw_i j/'^~‘(s) =0, S I cy | > 0. 

)-0 

It can be assumed that i 5 ^ 0, (differentiating (4.5.4) to get a term in 
if there is none originally). Then can be eliminated in (4.5.2), to get 

a relation of the same type with N replaced by iV — 1. Hence the process is 
non-degenerate if N is the minimum index for which (4.5.1) is true. It will now 
be proved that the x{t) process is a t.h.G.M. process. It can be assumed to be 
non-degenerate. Using (4.5.1), 


(4.5.5) £{x(r), r g 8]Xi{t)] = B{2/(t), r ^ a; y{t)} = E{x{8)\Xi(t)} , 

It must also be shown that 

(4.5.6) E{x{t), r g /?; Xj(f)] = E{x{s); XjH)] j = 2, • • • , AT. 

This will be shown by justifying the taking of derivatives in (4.5.5). It will be 
sufficient to prove (4.5.6) when j = 2. Using (4.5.1), 

(4.5.7) eL(t), r g s; = *'{a:(8); ~ 


The right hand side is a linear combination of Xi{s), • • • , Xsis) whose coefficients 
are continuous in h, fe ^ 0, since the correlation function of the y(t) process is 
continuous. Hence the right hand side converges to 


Eixis); y\t)\ = E{x{8)) x^it)] 
when h —► 0. Since the difference 


X{t) - £{*(«); 
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is uncorreiated with x(r) if r g s, the same is true of its limit as h —»0. Ihis 
means that (4.5.6) is true when j = 2, as was to be shown. Conversely if {y(t)} 
are the variables of a one-dimensional t.h.G. process, if y'(<), • • • , 
exist, and if the x(t) process defined by (4.5.3) is a t.h.G.M. process, the y{i) 
process is obviously a t.h.G.Mjf. process. The transition matrix function A(<) 
and the matrices Q, S, T of Theorem 4.3 are easily calculated. Suppose that 
the x(0 process is non-degenerate. Since y^'~^\t) is given by 


(4.5.8) 


y»-"(t) = Xi(t) = /' i: [c<‘-»*S)<ydfy(«) + L 
- L.«o ^-1 y -1 

= /‘ E [«“■*>''-Sl«d€y(«) + t (c‘«)yyXy(0) 

Jq i-i ^-1 


and since exists if i < N, it follows that the integrand must vanish when 
8 — t: 

(4.5.9) (S)o* - 0, i = 1, .. • , iV - 1, j = 1, •. ‘ , JV. 

Since S is symmetric and non-negative definite, S must have the form 




'0 • 

• • o' 


(4.5.10) 

S: 

0 • 

• • 0 1 ’ 

c ^ 0. 



.0 • 

• 0 c 


The fact that x<(0 = 

= means that 






N 


(4.5.11) 

E (e“'Q).y:cy(0) = 
y-i 

S (e"*)i+iyxy(0), 



or, since the x{t) process is non-degenerate, 
(4.5.12) ^ 


- (o.« 


Hence (< —» 0) Q has the form 

/ 0 10 • • 0 


(4.5.13) 

n 'o ' 

•01 


\ciff • 

• • • ffli 


Conversely if there is an iV-dimensional non-degenerate t.h.G.M. process with 
transition matrix fimction where Q is given by (4.5.13) and dispersion matrix 
S given by (4.5.10), 

i<(<) = *y+i(0. i = 1, • • • , iV - 1, 

and the Xi{t) process is a t.h.G.My. process. 
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Case 1. (S = 0 (deteministie case). In this case the x(t) process is deter¬ 
ministic: 

(4.5.14) xiD = 

Since Q satisfies its characteristic equation 

(4.5.15) * — • • • — Ojv = 0 
it follows tliat 

(4.5.16) x^"\t) - aix‘"-"(0-Ovx(t) = 0, 

(4.5.17) - a^t) = 0. 

The roots of (4.5.15) are simple roots, and are all pure imaginary, according to 
Theorem 4.3. It follows that 

(4.5.18) y(t) = 21 (vi cos Wj + f,- sin tB,) 

} 

where the and are one-dimensi6nal Gaussian variables, and [idj] are the 
distinct roots of (4.5.15). 

Case 2 . S 9 ^ Q {ncmrdeterininislic case). In this case it will now be shown 
that the x{t) process has no deterministic factor, that is that the roots of (4.5.15) 
all have negative real parts. In fact let be a root of (4.5.15), corresponding 
to the characteristic vector z oi Q*: 

(4.5.19) 3 = ■ ■ ■ , as + as-ip + • • • + ai^*'"‘) 

= (•••,/3''). 


Then using (4,3.5) 

0 < c* I /3 = (5-3, z) = -(B(0)Q*3, z) - {QR{0)z, z) 

(4.5.20) = - X(«{0)2, 2) - X(«(0)2, 2) 

= -(X -I- X)(ft(0)2, 2). 

Hence \ + \ is real and negative: X has a negative real part. In this non- 
deterministic case, therefore, the x{t) process can have no deterministic factor. 
The matrix T is the null matrix, and (4.3.2) becomes 

(4.5.21) x«) = j[‘5 d|(s) 
which leads to 


( 4 . 5 . 22 ) 


yit) - c£[e«->Vd«*.(s). 
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Moreover 


(4.5.23) 




!/'(«) - c f dU>) 

•^"00 


Since Q satisfies its characteristic equation (4.5.15), 

, (4.5.24) f ‘ -iaif dfv(«) =0. 

JLoo /-il 

In other words 

(4.5.25) c dfjir(a) — oij/‘""”(0 - ••• - ajty(t) = 0. 

Now formally, if 1^(0 existed, the last equation in (4.5.23) could be differentiated 
to give 

(4.5.26) y‘^>(<) = c[Q^-‘]ii,f;«) + c £ d£,(8) 

and (4.5.25) would become 

(4.5.27) ~ ‘ - a^yit) = c^s{t). 

(We are using the fact that =1.) Thus the t.h.G.Mjy. processes satisfy 

the formal differential equation (4.5.27) already discussed above from another 
point of view. Equation (4.4.2) is readily justified. 

Theorem 4,6. (i) Let {a;(0} be the variables determining a tJi,GM, process. 

Then considered as Junctions of the x{t) are continuous with probability L Let 
[yif)] = {Xr(t)} be the variables of a coordinate process. 

(ii) If y*(t) exists, it is a linear combination of coordinate functions: 

N 

J/'(0 = 2 c/a:,(<). 

J-1 

(iii) If y'{t), • • • , y^^'~^\t) exist, y{t) satisfies a generalized differential equation 
(4.4.1), that is the y(t) process is a t.h.G.Mif. process. 

(iv) If y'{t), • • •, y^^\t) exist, y{t) has derivatives of all orders. The y{t) process 
is a t.h.G.Ms. process {deterministic case) and y{t) therefore satisfies an Nth order 
homogeneous differential equation (4,5.17). 

(v) If xi{t), • • • , Xx{t) e^xist, that is ifx'{t) exists, the x{t) process is deterministic 
and the coordinate functions have derivatives of all orders. 

Proof of (i). It has already been shown that the {r(^)} determined by (4.3.14) 
determine a f-process, and the dispersion matrix function D{t) of the f-process, 
given by (4.3.16), is certainly continuous. Hence, by Theorem 4.1, the {r(OI> 
and therefore the {a:(0} a-r© continuous in t, with probability 1. 
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Proo/ of (tt). If Xr{f) exists, the rth row of S in (4.3.2) must vanish, and 2,(0 
is given by the rth coordinate of 


(4.6.1) 
Henoe 

(4.6.2) 


f Qe“->®Sd{(s) + (2e‘*x(0) = Qx{t). 

•L-oo 


4(0 * Z) (Q)Tjxj{t). 


Proof of (m). Suppose that Xr(t), • * •, ^\t) exist. Then (r is fixed in 

the following equations) 

xr(t) = [‘ z s],yd{,(«) + E nxm, 

J9 j -1 i-1 


(4.6.3) 


4(0 = W‘~‘''‘QS]ridUs) + E [e‘®Q]ryx,(0), 

- r Z [e“->«Q''-‘S],yd{,(«) + Z [e“'Q''-‘]2y(0) 


and (in order that the derivatives can exist) 
(4.6.4) 


Sri = 0 

iQS)ri = 0 


1 , 


AT. 


iQ’""S)ri = 0, 

Since Q satisfies its characteristic equation, say (4.5.15), 

(4.6.5) f‘ fe'‘-’«Q'".S]df(s) - Z a, f d{(8) = 0. 

7—1 J-oo 

This vector equation can be written (using only the rth coordinate) in the form 

(4.6.6) r z le'‘“*'‘'Q*'S]rydfy(8) - a,4"~”(0 - • • ■ - ay 2,(0 - 0. 

tL.«> fm.1 

If ('(t) existed, the last equation in (4.6.3) could be differentiated to give 

(4.6.7) 2‘'^>(0 = /“ Z [e“-‘>‘'Q'^S]ryd{y(s) + Z lQ^“‘S]ryf;(0 
and (4.6.7) would then become 


(4.6.8) 4''’(0 - ai2'"-‘>(0- 

Now the process with variables 

(4.6.9) Z (Q""^«)rife(0 

(C y.i 


ay 2,(0 = E (Q’'~'S)ri('(t). 
y-1 







OAUS«AK PSPCXSBES tff 

ifi a f-prooess, if e is chosen properly, unless the parenthesis in (4.6.9) viniislies 
for all j. In either case (iii) is proved. 

Proof of (tv). If in (iii), exists, (4.6.3) can be augmented to include 

(4.6.30 xj^>(() = r Z [e“-*>*Q'"51r/d{X8) + f [c‘“Q%(0) 

j-1 

and (4.6.4) now includes 

(4.6.40 (Q^'~'S)rf ^ 0, i=l, ...,Ar. 

In this case the last term in (4.6.7) vanishes and (4.6.8), with zero on the right 
hand side, is strictly true. 

Proof of (v). If x[{t), • • , x^Q) exist, S must vanish and (4.6.3) yields 

(4.6.10) x{t) = e'^x(0), x^’'\t) = Q^xH), ^ 

Thus the xit) process is deterministic and x(t) has derivatives of all orders. 

Theorem 4.7. Let |x(0} he the variables of a oiie-dimensional Lh,G. process. 
The process is a component process of an N-dimensional Lh,G,M, process if and 
only if the chance variables 

(4.7.1) .t(0), {E{xi8), 8 ^ 0; x(t)]} 0 < t < oo 

are linearly dependent on N variables. 

Suppose that the x(t) process is a component process of an i^-dimensional 
t.h.G.M. y(t) process: x(t) = yrit), and let A(t) be the transition matrix function 
of the y{t) process. Then if € > 0 and if n is any integer, the difference 

y[(n ])c] - A(€)y{n€) 

is independent of every y{s) with s ^ rn, and therefore independent of every 
yime) with m ^ n. Hence the yine) process is a t.h.G.M. process (discrete 
parameter case). Ecpiation (3.8.5) becomes, in this case, if n = 0, 

(4.7.2) xliN + ph] - a{"^xl{N - 1)«] - • • ■ - or’a;(0) = rii'\N + y) 

where rii’^iN + p) is not merely independent of the variables • • • , x(—*), x(0), 
but is even independent of every x(s) with s ^ 0. It then follows, applying the 
operator J57{x(«), a* ^ 0; -} to both sides of (4.7.2), that the variables in (4.7.1) 
are linearly dependent on N variables if t is restricted to be a multiple of €. 
Allowing € to run through the values 



it follows that the statement of the theorem is true if ^ is restricted to be rational. 
The proof will be complete when it is shown that the subject** variables for ra¬ 
tional t are dense in the whole class in the sense that for any the expectatii!m 

(4.7.3) 5 = E{[E{x{8), s S O; x(0) — E[x{8)j s g 0; x(0}f I 


** Courtesy of U. S. Navy. 
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conva^es to 0 when 2' -* L In fact, using the Schwarz inequality 

^ ^ E(IE{x(8), 8^0; x(t') - a:(0}]*} 

(4.7.4) - . 

g E{E{x( 8), 8 g 0; [x(n - x(e)r}} - E{lx(n - x(t)f} 

and the basic continuity h 3 rpothesis (1.3.1) imposed on continuous processes is 
precisely that tJie last expectation converges to 0 when t' t. 

Conversely suppose that the chance variables (4.7.1) are linearly dependent 
on N variables. It can be supposed that x(0) is one of these N, I^et the others 
be those for which ^ = < 2 , • * * , , and define j/iit), • • • , y^iO by 


(4.7.5) 


yi(t) = x(t) 

yi{t) = E{x(8), s g t; x(t + <>) ; = 2, - • , iV. 


The y{t) process is obviously an iV-dimensional t.h.G. process. Moreover 
E{y{8), 8^0; yj(t) = Elxia), s ^ 0; yj{t)} 

(4.7.6) 

=* i^{x(s), 8^0; x{t + ti)] j = 1 , ... , 

(where t\ is defined as 0). Since the right side is by hypothesis, for each j, a 
linear combination of yi(0), • • • , 2/y(0), the y{t) process is a t.h.G.M. process, 
and the x{t) process is a component process, as was to be shown. 

A detailed examination will now be made of t.h.G.Miir. processes, and of the 
more general dhss of component processes of t.h.G.M. processes. The following 
theorem will be useful. 

Theorem 4.8. Let {x(0 } be the variablea determining a t.h,G, continuous paramr 
eter process. The process is a component process of an N-dimensional t.h.G.M. 
process if and only if for each €> 0 the discrete parameter process with variables 
{x(ne)} is a component process of an N-dimensional t.h.G.M. process. 

If the x{t) process is a component process of an i\r-dimensional t.h.G.M. y{t) 
process, the x(n€) process is a component process of the iV^-dimensional t.h.G.M. 
y{ne) process. Conversely suppose that the xine) process is a component process 
of an iV-dimensional t.h.G.M. process (which may depend on e) for every c > 0. 
It follows that for each c > 0 the chance variables 


(4.8.1) £{•••, x(-e), a:(0);a;(n e )|, n = 0, 1, • • • 

are linearly dependent on N of their number. Hence the same is true of the 
following chance variables, if !», m are fixed and v > m: 

(4.8.2) E{ ••• , x(0); x(n/m} n = 0, 1, • • • . 

According to the Corollary to Theorem 1.2, when V —»<» the conditional expectar 
tions in (4.8.2) converge to 

E[x{8), s ^ 0, 8 rational; x(n/ffi)} n = 0,1, • • • . 


(4.85) 
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Hence the chance variables (< rational) 

(4.8.4) iE{x(8), s g ,0, s rational; x(0} = E{x(8)^ 8 g 0; a;(0}*^, 

0 <t < 00 , 

are linearly dependent on N of their number. As in the proof of Theorem 4,7 
it follows that the same is true if t runs through all positive real numbers, and 
according to Theorem 4.7, the x(t) process is therefore a component process of 
an iV-dimensional t.h.G.M. process. 

Theorem 4.9. Let {a;{0) be the variables of a one-^imen8i<mal cantinvms 
parameter t,h,G. process. The process is a component process of a Jmite-dimefh 
sional t,h.G.M, process if and only if the complex spectral function of the process 
is the sum of the infegral of the square of the absolute value of a rational function of 
X and of a rn^otone non-decreasing function increasing only in a finite number of 
jumps,Specifically; 

(i) The process is a component process of an N-dimensional t,h,GM. process if 
and only if the complex spectral function has the form 


(4.9.1) 


e(x) - £ 


I do(tX)^ + •' • + djy-i I 

{i\r + + ... 


d\ + 6(\) 


where 

(a) Q(\) is a monotone n^ynndecreasiiig function satisfying (1.3.3) and increasing 
only in jumps^ at no more than N points. 

(b) the denominator of the integrand vanishes at every discontinuity of fl^(X), and 
the numerator vanishes at every zero of the denominator^ to at least the same order; 

(c) the coefficients in the integrand are real, and the roots of the X polynomials are 
all on the real axis or in the upper half plane. 

The integral vanishes identically if and only if the x(n) proems is a component 
process of an N-dimensional deterministic process, and G(\) vanishes identically if 
and only if the variables {x(0} vanish identically or the x{t) process is a component 
process of an N-dimensional t,h.G,M. process with no deterministic factor, 

(ii) The process is a t.h.G.M k. process, in the deterministic case, if and only if 
the complex spectral function G(\) = G{\) is a function increoMng only in jumps, 
at 'no more than N points; non-deterministic case if and only if the complex spectral 
function has the form 


(4.9.2) 



cdK 

{iky + ... + ajvl 


* The equality (4.8.4) is proved as follows. Let t be fixed, and let x be the chance vari¬ 
able on the left. Then x{t) — x has mean 0 and is uncorrelated with every a;(s) with 8 0 and 

rational. It follows at once from the continuity of hypothesis (1.3.1) that then x(t) — «is 
uncorrelated with every x(8) with a ^ 0: it follows that (4.8.4) is true. 

^ It is easily seen t^t the first term of the two can also be described simply as the in¬ 
tegral of a rational function of X, which is nop-negative for real X and is integrable and an 
even function, like all complex spectral density functions. 
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Proc{/ cjf (t). Suppose that the x(i) process is a one-dimeosional compoaent 
process of an iiT-dimensional t.h.G.M. p(t) process, x(t) = yt(t). It is no restric¬ 
tion to assume that the y(t) process is non-singular. Then the correlation func¬ 
tion of the y(t) process is given by 

(4.95) 


i ^ 0 
t g 0, 


where Q is uniquely determined and 


0(X) - 0(0) 


(4.9.4) 




L 


• MX 


- 1 


ti 


Ry(t) dt 


I 


-f 

2x1. 


- 1 
it 


[/2y(f)]u dt, 


at the points of continuity of G(X). 

The correlation function Ry(t) has derivatives of all orders for ^ > 0: 

t > 0 

(4.9.5) 

= t < 0. 

Suppose first that the y{t) process has no deterministic factor, in other words 
that it is non-degenerate and of type M. Then the characteristic values of Q 
have negative real parts and R(t) 0 exponentially when | ^ 1 «>. Hence 

0{\) has a continuous derivative (j'(X): 


(4.».e) o'W-sZ] 

Integrating by parts, 

0'(X) = 

f2;(o-f) - fl;(o-) 




[Ay(()]ii dt. 


(4.9.7) 


2»(t\)* 

H -fl 

2x(»\)* 


+ 


•0 

e 






= fi^(0+) - fiy(O-) _ g»(0+) - fiv(O-) _ J_ r i' 


2x(tX)‘ 


2xJL«(tX)» 


K(t), 


Since Q satisfies its characteristic equation 
(4,9.8) — • • • — fljv = 0, 

it follows that 

Ri^\t) - aiRt’'-^\t) - asBy(t) ^0 t>0 

- • • • + (-!)''■*«,(<) “0 f < 0 
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and therefore if 17 is the operator ~ 

ut 


(4.9.10) 


[U*' - aiV”-^ - anl/HU’' + oiU 


■M-i 


+ tpto. 

Applying (4.9.10) to (4.9.6) 

[(*X)*^ — oi(tX)*^ * — • • • — Ow][(»X)^ + Oi(»X)^~^ + • • • 

(4.9.11) + (-l)^-'o^]<?'(X) 


= 1 (tX)'^ - Oi(tX)''-‘-oy 1*<?'(\) = P(»X) 

where P(tX) is a polynomial of degree 2n — 2. Since P(tX) is real and non- 
native, when X is real, the roots on the real axis are of even multiplicity and 
those oS the axis are symmetric in the axis. Moreover P(tX) is even, since the 
left side of (4.9.11) is even. It follows easily that P(tX) can be written in the form 


(4.9.12) 


P(iX) 


w-l 




where the roots of the polynomial are ail on or to the left of the imaginary axis. 
Finally 


(4.9.13) 


G'(X) 


I /3b(fX)^ ^ I* 

I (iX)^ — • • • — I* 


The denominator polynomial in X vanishes only at points where i\ has a negar 
tive real part, that is where X has a positive imaginary part. This completes 
the proof in the case where the iV-dimensional y{t) process has no deterministic 
factor. If there are such factors, it is easily verified that (7(X) has corresponding 
discontinuities and the above proof then applies to Cr(X) less its jump function. 
The result can finally be summarized as in the statement of the theorem. If the 
y{t) process has only deterministic factors [i?y(<)lii will be a sum of trigonometric 
functions and G{\) will be a function of jumps. 

Conversely suppose that the x{t) process has the complex spectral function 

(4.9.13) . Then following the ideas of the proof of the analogous section pf 
Theorem 3.9, it follows that R(t) satisfies the differential equation (cf. (3.9.10) 
and (3.9.11)). 

(4.9.14) P'^'(<) - a,P‘"~“(0 - . •. - a„m) = 0,t> ,0. 

Any solution of (4.9.14) is a linear combination of (at mast N) functions 
(4.9.16) te"*, • • • 


where is a root of the equation 

(4.9.16) a’' - aia"~^ - • • • - a# = 0 

and where powers of t may appear if is a multiple root. Let € be a positive 
number. The discrete parameter process determined by the variables {a;(n<)) 
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has correlation function R{n€). This function is a linear combination of func¬ 
tion 

(4.9.160 n(s^)^ •• • 

corresponding to those of (4.9.15). There is an equation 

(4.9.17) - ai(c)a^’'-- - aj,(€) « 0 

with the {e^*} as roots, of the same multiplicity as that of jS in (4.9.16). Hence 

(4.9.18) R[{n + N)€] - ai(€)5:[(n + iV - 1)€] - • • • - a^(e)B(n6) « 0, 

n ^ 0. 

According to Theorem 3.10 the xine) discrete parameter process is therefore 
a component process of an i\r-dimensional discrete parameter t.h.G.M. process. 
Since this is true for all €, the x{t) process is a component process of an AT-dimen- 
sional continuous parameter t.h.G.M. process. 

If the integral vanishes identically, the non-deterministic factors in the N- 
dimensional process are irrelevant to the x{t) process and can be replaced by 
factors of type M(0). If on the other hand the spectral function is continuous, 
the deterministic factors are irrelevant and can be replaced by factors of 
type M, 

Proof of (m). Since the t.h.G.Mjv. processes are characterized among the 
component processes of JV-dimensional t.h.G.M. processes by the fact that the 
first iV — 1 derived process exist, their spectral functions (according to Theorem 
1.4) are characterized by the property that 



that is the numerator in (4.9.1) must be identically constant. If this constant 
is not 0, S'(X) can have no jumps, since each jump corresponds to a zero of nu¬ 
merator and denominator. Hence G(X) is either identically S'(X) or is in the 
form (4.9.2). The two possibilities obviously correspond to the deterministic 
and non-deterministic cases, respectively. 

CoKOLLARY. The IM.GMn- one dimensional 'process which is the solution of 
(4.4.1) has complex spectral function 


(4.9.19) 



c'dX 

ai(fX)^-> - 


— ai 


In fact the complex spectral function has the form (4.9.2), where the coeflBi- 
cients in the polynomial are those of the differential equation for the correlation 
function Ryif) in (4.9.9), that is the coefficients of the characteristic equation of 
the infinitesimal transition matrix Q, (cf. (4.4.8)). The evaluation (4.9.19) is 
also easily proved directly. 

The analogues of Theorems 3.10 and 3.11 in the continuous parameter case 
are easy to prove and will be omitted. 




ON CUMULATIVE SUMS OF RANDOM VARIABLES 

Bt Abbaham Wabd 
Cdumbia Univeraity 

1. Introduction. Let {c<) (i 1,2, • • •, ad inf.) be a sequence of independmt 
random variables each having the same distribution. Denote by Zf the sum 
of the first j elements of the sequence [Zt], i.e., 

(1) Zj=^ zi + zt+ -f- O' “ 1. 2, • • •, ad hrf.). 

Let a be a given positive constant and b a given negative constant. D^ote 
by n the smallest positive integer for which Zn lies outside the open interval (b, 
o), i.e., Zn is either < b or > a. Obviously n is a random variable. If b < Z< < 
a for i = 1,2, • • • , ad inf., we shall say that n = «. 

For any relation R we shall denote the probability that R holds by P(R). 
It will be shown later that P(n = ») = 0, provided the variance of cj is positiv©. 

In this paper we shall deal with the problem of obtaining the value of P(Zn > 
o)‘ and that of finding the probability distribution of n. 

The study of such cumulative sums is of interest in various statistical prob¬ 
lems. For example, a multiple sampling scheme proposed recently by Waltar 
Bartky’ makes use of such cumulative sums. 

Cumulative sums also play an important role in the theory of the random 
walk of interest in physics. The results obtained in this paper may have bear¬ 
ing particularly on the theory of the random walk with absorbing barriers. In 
the presence of an absorbing wall the random walk stops whenever the particle 
arrives at the wall, i.e., whenever the cumulative sum of the displacements 
reaches a certain value.* 

2. Two Lemmas. Lemma 1. If the variance of is not zero, P{n = «) = 0. 

Proof: Let c = |a|-|-|b|. Ifn=oo then for any positive integer r the 

following inequalities must hold 

( (t+l)r \J 

2 <) < c* (fc = 0,1, 2, • • •, ;ad inf.). 

«^r+l / 

To prove P{n = oo) = 0, it is sufficient to show that the probability is zero that 
(2) holds for all integer values of A;. Since the variance of Zi is not zero, the ex- 

1 Since P(n = oe) » 0, we have P{Zn ^ 5) » 1 - P{Zn ^ a). 

* “Multiple sampling with constant probability”, Annals of Math, SiaLj Vol. 14 (1943), 
pp. 363-377. 

’ See in this connection S. Chandrasekhar, “Stochastic problems in physics and astron* 
omy”, Rev, of Modern Physics^ Vol, 15 (1943), p. 6. 
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tij conveiges to «> ae j —> ao. Hence th^ exists a podtive 
integer r such that 

Freon (3) it follows that the probability that (2) is fulfilled for aQ values of is 
equal to zero. Hence P(n = «) = 0 and Lemma 1 is proved. 

Lemma 2. Let z be a random variable ouch that the foUowirig four eonditiona are 
fidfiUed: 

Condition I. Both the expected value Ez of z and the variance of z exist and are 
unequal to zero. 

Condition II. There existe a positive 5 sttch that P(e* < 1 — 5) > 0 and P(e* > 

1 + «) > 0 . 

Condition III. For any real value h the expected value = g(h) exists. 
Condition IV. The first two derivatives of the function g(h) exist and may be 
obtained by differentiation under the integral sign, i.e., 

g'(h) - 4 
an 


pected value 


-(§ 


and 


g'\h) = ^^E^ = Eie"'. 
cur 

Then there exists one and only one real value ^ ^ 0 such that 

Pe**' = 1. 

Proof: For any positive h we have 

(4) g{h) > P{e‘ > 1 + «)(1 + «)\ 
Hence, since P{e‘ > 1 + 5) > 0, 

(5) ' limflr(A) * +». 

Similarly we see that for any negative h 

(6) g{h) > P{e‘ < I - S)0. - S)\ 
Hence, since P{e* < 1 — 8) > 0, we have 

(7) Ihn g{h) = +». 

h m ~ m 

Sinnft g"Qi) = FzV* it follows easily from Condition II that 

(8) > 0, 

for all real values of h. 



TTie rdatioQs (6), (7) and (8) imjdy there exjsts exae% mss.tealrvphw 
for Which g(h) takes its rainimum value. Since Ezia unequal td fero 

by Cundition I, we see that h* 0 and g(h*) <, jr(0) *= 1. It is clear that t^ 
function g(h) is monotonically decreasing in the strict seuM over the interval 
(— <», h*), and is manotonicaOy increasing in the strict sense over the intwval 
(h*, + <»)• Since {^(0) = 1 and g(h*) < 1, there exists exactly <Hie real value 
fto 0 such that g(Ih) = 1. Hence Lemma 2 is proved. 

3. A fundamental identity. Denote by 2 a random variable whose distribu¬ 
tion is equal to the common distribution of Zi(i = 1, 2, • • • , ad inf.). Let O’ 
be the subset of the complex plane such that Ee‘* — <p{f) erists and is finite for 
any point t in D’. Consider the following identity 

(9) ^ ^ 

where N denotes a positive integer. Let Pk be the probability that n ^ N. 
For any random variable u denote by Eniv) the conditional expected vidue d 
u under the restriction that n < N, and by Fj(a) the conditional e:q)eoted 
value of u under the restriction that n> N. Then identity (9) can be written as 

(10) + (1 - Pj»)Fve'*" = [^(Or. 

Since in the subpopulation defined by any fixed n < JV the expression Zh — 
Zn is independent of Zn, we have 

( 11 ) = £^••‘[^( 01 *'"". 

From (10) and (11) we obtain the identity 

(12) + (1 - P^)Ete*”‘ = [v>«)r. 

Dividing both sides by [¥>(0]*^ we obtain 

(13) PyEAe’'%it)r\ + (1 - Pi.) - 1. 

Let D" be the subset of the complex jdane in which'] ip(t) | > 1 and denote by 
D the common part of the subsets D' and D*'. Since lim (1 — P^) =‘ 0, 

and since | Etie*"*) \ is a bounded function of N, we have in D 

BF* 

Since 

lim P^F^{e'«'[,,(0r"} = P{e^-‘[^(0-“}, 

■ Norn 

we obtain from (13) and (14) the fundamental identity 

(16) F{e'-V(0r"} » 1, 

for any pmnt t in the set D. 
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4. DMivatiim of fte probability that Z„ > a. In what follows in this and the 
sobseciu^t secticHis we shall dways assume that the random variable z satisfies 
the conditions I-tV of Lemma 2, even if this is not stated e]q)licitiy. Since it 
follows from Condition III that the set D’ is the whole complex plane, we see 
that the identity (15) must hold for all points t for which | (pit) | > 1. 

Let Ao 0 be the real value for which ipQio) =“ 1. Substituting ho for t in (16) 
we obtain 


(16) 


= 1 . 


Let El be the conditional expected value of e*"'** under the restriction that 
Zn> a and let Eo be the conditional expected value of e*"** under the restriction 
that Zn < b. Furthermore denote P(Zn > a) by a. Then it follows from (16) 

(17) aEi + (1 — a)Eo = 1. 


Hence 

(18) 


1 - Eo 


Ei-Eo' 

If Ao > 0 then Ei > 1 and Eo < 1. Hence (18) implies the inequality 


(19) (Ao>0). 
If Ao < 0 then < 1 and J?o > 1. Hence (18) implies the inequality 

(20) 1_„<1<4-, (A. <0). 


We shall now derive lower and upper limits for and Ei . We derive these 
limits under the assumption that ho > 0, To obtain a lower limit of Eo consider 
a real variable f which is restricted to values > 1. For any random variable u 
and any relation R we shall denote by E{u\ R) the conditional expected value 
of u under the restriction that R holds. Denote by P(f) the probability that 

have 


(21) Eo = 1 

Hence a lower bound of Eo is given by 

( 22 ) 


Eo = c“* |g.Lb. (E (c*** I c**' < , 


where the symbol g.l.b. stands for greatest lower bound with respect Co j*. Since 
t 

is an upper bound of Eo, we obtain the limits 


(23) 


e“*|g.l.b. I «*** < 01 ^Eo< 


(ho > 0). 



Ctni|7I<4XIV|E SUMS 




Let /> be a real vaciaUe teetriicted to yaluee > 0 and < !• Denote 
'Uie probability that c****“‘ < Then similadiy to (21) we obtidn 


(24) 


(«*•* I «*•* > 0}‘W(p). 


Hence an upper bound of Ei is given by 


(26) 




|l.u.b.p£(e*-! «*•* > 01. 


Since e***® is a lower bound of Ei , we obtain the following limits for Ei 


(26) 




< El < e'** |l.u.b. 1 e*** > , (h > 0). 


In a similar way upper and lower limits can be derived for Eo and Ei when ho < 
0. With the help of these limits upper and lower limits for a can be derived on 
the basis of equation (18). If ^ > 0 then Ei > 1, Eo < 1 and consequently 
the right hand side of (18) is a monotonically decreasing function of Eo and Ei. 
Hence if is a lower, and is an upper bound of Ei(i = 0,1), then 


(27) 


1 - < 


< a ^ 


1 - E'o 
E'l-E'o’ 


ik, > 0), 


In a similar way limits for a can be obtained when ho <0. If both the absolute 
value of Ez and the variance of z are small, Eo and Ei will be nearly equal to 
e**" and e’**, respectively. Hence, in this case a good approximation to a is 
given by the expression 


(28) 



The difference a — a approach^ zero if both the mean and standard deviation 
of z converge to zero. 


6. The characteristic function of n. Let be a random variable defined as 
follows: = a if > o and = i> if Z„ < b. Denote the difference Z, •*• Zn 

by €. Then c is a random variable. 

In what follows we shall neglect e i.e., we shall substitute 0 for c. No error 
is committed by doing so in the special case when z can take only two values d 
and —d and the ratios a/d and b/d are integers, sinoe in this case c is exactiy zero. 
Apart from this special case the variate c will not be identical with the constant 
zero. However, the smaller the values | Ez \ and Ez*, the smaller the error we 
commit by neglecting e. In fact, for arbitrary small potitive numbers ^ and <§ 
the inequality p( | e | < ii) > 1 — ia will hold if | £z | and Es? are sufficientiy 
small. Thus in the limiting case when Ez and E;? approach zero the random 
variable c reduces to the constant zero. 
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(a) . The ohoracteristic funcHm of n when only one of the guantUiee a aiul b iefirUte. 
It will be sufficient to treat the case when a is finite and b — oo, In this case 
n is defined as the smallest positive integer for which Zn > a. To make the 
probability of the existence of such a value n to be equal to 1 we have to assume 
that the expected value /u of 2 is positive. Since b — 00 , the fundamental 
identity (15) need not hold for all points t of the set D. However, it follows 
easily from (13) that (15) holds for all points t in D whose real part is non-nega¬ 
tive. Denote by ^(r) the characteristic function of n (r is a purely imaginary 
variable). Since Zn — a (neglecting c), and 

w)r = ^[-"iog^(0], 

identity (15) can be written as 

(29) log^(0] = 1. 

Let tir) denote a root (with non-negative leal part) of the equation in t 

(30) log (p(t) + r = 0, 
and substitute tir) for t in (29). Then we obtain 

(31) ^(r) = 

As an illustration let us calculate ^(r) in the case when 2 is normally dis¬ 
tributed. In this case 

log <p(t) * 

where m is the mean and <r is the standard deviation of 2 . Hence 

(32) 

If we take the + sign before the square root sign, the real part of i(r) is non¬ 
negative, since the real part of — 2<7®r is greater than or equal to fi. Hence 
the characteristic function of n is given by 

(33) ^(r) = (m > 0). 

(b) . The characteristic function of n when a and b both are finite. Given the value 
of n, let Pft be the conditional probability that Zn = a. Let pj denote the prob¬ 
ability that n is the smallest positive integer for which either Zn = a or Zn = b 
holds. Neglecting Zn — Zn , identity (15) can be written as 

(34) Z [p,c“‘ + (1 - p»)«“][v>(0]“"pJ: - 1. 

n«il 

Let ^i(r) be the characteristic function of n in the subpopulation where » a, 
and let ^(r) be the characteristic function of n in the subpopulation where 2. » 



dtniB ^ 


b,' Furthermore let ^(r) be the dmractedstiic funeticm n in the total pd^ 
lation. 

Since we neglect the difference 2fn — , it follows from (18) tibat the preib* 

ability a that 2n « a is given by * 


(36) 



Putting 1 — p, 
(36) 


qn the following rdations hold 

Z p»p!W)]- 

Hpnpl 


ll'lt—log <p{t)] - 


H Pnpt[y(0)~* 


(37) 


(38) 


Z7»P« 1-“ 

lAt —log <pit)\ =* Zp3»Iv»(0]~" = Z(pn + ?!»)[<»(<)]"" pt 


oil'll—log <p(01 + (1 — o)!!-!!—logv>(ft}< 


Putting — log^(0 = r we obtain from (34), (36) and (37) 

(39) o£^i(T)e®* + (1 — a)}lfi{r)e^^ = L 

According to Lemma 2 the equation --log (p{t) == 0 has two different real roots 
in = 0 and t = ho, and ^'(0) and <p'(ho) both are unequal to zero. Hence, if 
<p{t) is not singular at f = 0 and / = fto, the equation 

~log^(0 = T, 


has two roots /i(t) and ttir) for suflBciently small values of r such that lim <i(r) 

T^O 

= 0 and lim <s(t) = ho. Since the identity (15) holds for all values of t for 

r«>0 

which I (flit) I > 1, and since | v[<i(t)] 1 * j ^[<j(t)] | = 1 for all imaginary values 
of r, it follows from (39) that both equations hold 

(39') «^i(r)e''>'^> + (1 - «)^(t)c*“‘'' “ 1, 

(39") «^i(r)e“*« + (1 - «)^(r)c‘'*‘^> - 1. 


Solving these two linear equations we obtain ^i(r) and ^i(r). The character¬ 
istic function ^(r) is given by , 

^(r) = a^i(r) + (1 ~ a)th{r). 


As an illustration we shall determine ^i(r), ^(r) and ^(r) when z has a normal 
distribution with mean ^ and standard deviation ir. We have 

-log v(0 - 5 “ r- 
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Hence 

(40) 1-- 

Putting e* = A and e’’ = B we obtain from (39) and (40) 

(41) + (1 - «)»b(T)B-i‘/'*+‘'»*V?:^ 3. 1 

(42) a^i(T)i4-»‘''*-‘/»VM«-«»»T + (1 — = 1. 

These two equations are valid for any imaginary value of r. Since = 
we obtain from (35) 


-2n 


(43) 
Let 

(44) 
and 
(46) 


?i ■= — ^ ^ Vm* — 2(»*t, 


*= ~ ^ V^#** " 2a®T. 


Then \^e obtain from (41) and (42) 


(46) 
and 

(47) 


a\f'i(T) = 


(1 — a)}f>i(r) 




- ^'*15'’*’ 


A'‘F'* - A” S'* 
Hence the characteristic function of n is given by 

A'* + - A" - B*‘ 


(48) 


^(t) = 


A«b»» _ 


6. The distribution of n when 2 is normally distributed, (a) The case when 
a i« finite and h = —<*>. In this case the characteristic function of n is given 
by (33). Let 

2 

(49) “ £i”- 

Then the characteristic function of m is given by 
(80) ^(i) •= ^ 






whei« 


(61) 


Tlie distribution of m is given by 

(52) 

1 

2 irt JL^ 

Let 


(53) 

0(e, m) = i 

and 


(54) 


Since 


(55) 

1 d —mt ^ ^ I-4»I 

2 in<fl 2 irtV2 v'l-< / 

we have 


(56) 

1 H(c, m) - mO(c, m) = = 0. 

From (53) and (54) we obtain 

(57) 

aff fe ») ^ 0 

From (56) and (57) it follows that 

(68) 


Hence 


(59) 

log ff(c, m) = + log X(m) 

where X(f») 

is some function of m only. Thus 

(60) 

H(c, m) = X(»»)e“‘’'*". 

Now we 

shall determine X(ni). We have 


>'<’•)-®<o. 7= 


(61) 
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Since (1 — is the characteristie function of where x* has the x^-distrihu- 
tioQ with one degree of freedom, the right hand side of (61) is equal to 

r(i)Vm 

Hmce 

( 62 ) X(») - 

From (60) and (61) we obtain 

From (56) and (63) we obtain 


(64) 


0(c, m) = 


2r(i)m*/2 

Hence the distribution of m is given by 


( 66 ) 


F{m) dm 


2r(i)m*» 


dm, 


(0 ^ TO < oo). 


Let TO == 5 TO*. Then the distribution of m* is given by 
2 


jD(to*) dm* 


c»/2 


( 66 ) 


-a I 


8/2 C 

♦\«/2 


,-(c/2)(l/m*-Hn*-2) 


dm* 


(TO*)’ 


Vc 

V^{m*f* 


-(0/2) »/"••+«•-*) 


dm*. 


The function —j + Jn* — 2 is non-negative and is equal to zero only when to* = 1. 
fnr 

If c is large, then D{m*) is exceedingly small for values of m* not close to 1. 
Expanding + m* — 2 in a Taylor series around m* = 1, we obtain 


m 


1 


—2 + m* — 2 » (m* — 1)* + higher order terms. 


(67) 

Hence for large c 

(68) F>{m*) dm* 


Vc .-( 
-s/Sii 


(e/2)(m*-l)» 


dm*. 


i.e., if c is large m* is nearly normally distributed with mean equal to 1 and 
standard deviation —p^. 



cinnjL&«(!rs,'fDMs 


(b)< ThecaiKXBhmaaaibbo&turefinite. InthiseMetiMdiancteriBttefi^^ 
of is given by (48). Let i 


Tbm the efaaracteristic function of mios given by H . 
Ml . A*- + 


whe*e 


hi = d(l — Vl — 


d(l + VT^t), 


tuid t is an imaginary variable. Putting A, ^ B, da & and <fb * S, 
the characteristic function of m can he written as 

A?(e-«v1=« - ^Vi=i) + S(e*v^ - 

(71) _ ' 

ji(e—— g(4«—•}- 5(e*v^*^ — ^(i—ibVw) 

^5(1 - e»<«-»)vl=«) ' 

It will be sufficient to consider only the case when m > 0, since thn case < 0 can 
be treated in a similar Way. Then d < 0 and S > 0. Since the reid part of 
+\/l — t is greater than or equal to one, we have 

(72) 1 I < 1, 

for any imaginary value of t. Let 

(73) T - 


1 

1 - T 


rr'. 

j-0 


From (71) and (74) it follows that f*{t) can be written in the form of an infinite 
series. , 

(76) ' ^*(0 » 

where X, and r, are constants and X, > 0." ^ESjtih tbrm (rf this series is a character¬ 
istic function of the form given in (50) except for a pro^rtiwality factor. Let 
F,(m) be the distribution of m corresponding to the characteristic function 
gx,-x,>/n5. Then F,(»») can be obtained from (65) by substituting X 4 for c. ffince 
we may integrate the right hand dde p^mber of (75) term by term, the disr 
tribution of in is given by 

(76) F(m) dm - f‘^*"0 
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Siaoe m is a discrete variable, it may seem paradoxical that we dbtmned $, 
probability density function for m. However, the explanation lies in the fact 
that we neglected t = Zn — Zn and this quantity is zero only in the limiting case 
when M and v approach zero. 

If I I and are sufficiently small as compared with a and | 6 |, the distribu* 
tion of m given in (76) will be a goodi approximation to the exact distribution of 

m, even if z is not normally distributed. The reason for this can be indicated 
as follows: Let 

(77) z? = £ (f - 1,2, • • •, ad inf.) 

where r is a given positive integer. Since the variates are independently dis¬ 
tributed each having the same distribution, under some weak conditions the 
variates z *(i = 1, 2, • • • , ad inf.) will be nearly normally distributed for large r. 
Hence, considering the cumulative sums Zf = z* + z*+ * • • + 2 ? (t = 1,2, • • • , 
ad inf.), the distribution given in (76) is applicable with good approximation, 
provided that r | m | and x/ra are small as compared with a and 161 so that the 
difference €* « ZJ — Zj can be neglected. 

7« The exact probability distribution of Zn and the exact characteristic func~ 
tion of n when z can take only integral multiples 'of a given constant d. In the 

previous sections we derived the probability P(Zn > a) and the characteristic 
function of n under the assumption that the quantity by which Z« may differ 
from a or & is small and can be neglected. This can be done whenever | Ez | 
and Ez^ are small. However, \{\Ez\ or E;? is not small, it is desirable to derive 
the exact probability distribution of Zn and the exact characteristic function of 

n. Both are obtained in the present section for random variables z which can 
take only a finite number of integral multiples of a given constant d. This is a 
rather general result, since any distribution of z can be approximated arbitrarily 
fine by a discrete distribution of the above type if the constant d is chosen suf¬ 
ficiently small. 

There is no loss of generality in assuming tliat d = 1, since the quantity d 
can be chosen as the unit of measurement. Thus, we shall assume that z takes 
only a finite number of integral values. Let gi and be two positive integers 
such that P{z = —gi) and P{z = ^j) are positive and z can take only integral 
values > —jfi and < gt. Denote P{z = t) by hi . Then the characteristic 
function of * is given by 

(78) ^(0 * 23 

To obtain the roots of the ^nation ^(0 * 1, we piit e* * v and solve the equa¬ 
tion 


( 79 ) 


hi vf * 1 . 



CUUUIrATiyX BUMS 


Denote gi + gt^y 9 <um 1 let the g roots of (79) be ui ,•••,%, req)ectively. We 
shall assume tbat no two roots are equal, i.e., Ui 9 ^ uj for i 9 ^ j, Substitutihg 
Ui for e* in the identity (16) we obtain 

(80) Eiu*r)^l 

Denote by [a] the smidlest integer > a, and by [6] the largest integer < b. Then 
Zn can take only the values 

(81) (61-ffi+l. [b]-gi + 2,-Ab],[a], M + 1, • • • , [oj + p* - 1 . 

Denote the g different integers in (81) by ci, • • • , c,, respectively. Further¬ 
more, denote P{Z» » c<) by . Then equations (80) can be written as 

(82) = 1 (»•-1, •••.If). 

Let A be the determinant value of the matrix || uV || {i,j= 1. * * • . g) let 
Ay be the determinant we obtain from A by substituting 1 for the dements in 
the jth column. If A 0, it follows from (82) that P(Z» «= cy) (y is given by 


(83) 

Hence, P(Zn > a) = 2 (Ay/A) summed for all values of j for which cy > a. 

i 

From the probability distribution of Zn we can easily derive the expected value 
En of n. In fact, differentiating the fundamental identity (15) sdth respect to 
I at I = 0 we obtain 


(84) 



m "J 


0 . 


Since 




= Ez, we obtain from (84) 


(85) 


En 


EZ» ^ 2_ 

Ez Ez y_i A 


Now we shall derive the exact characteristic function of n. Denote by 
^,(r) (r is a purely imaginary variable) the characteristic function of the condi¬ 
tional distribution of n under the restriction that Zn =‘ Ct, Let li(r), ■ ■ ■ , <«(r) 
be g roots of the equation 


(86) 

^(0 “ 

such that 


(87) 

lime"*"’ - Ui 


Substituting U(t) for t in tiie fundamental identity (15) we obtain 


( 88 ) 


£ $y«*'‘“^V/(r) - 1 
>•-1 


(* - 1. • • • . ff). 
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These equations are linear in the. unknowns “ • > iti'*) the deter¬ 
minant of these equations is given 

••• 

••• 

(89) , ... «(r) 

^g«i‘<rO) .., 

Obviously, 8(0) = fife • • • feA. Hence if f,- 5^ 0 (i = !,•••, g) and A 0, 
also 8(0) 9 ^ 0 and consequently 8(r) 9 ^ 0 for any t with sufficiently small absolute 
value. Thus, if’i(r),'''^‘• V\^'»(r) can be obtained by solving the linear equations 
(88). The characteristic function ^(t) of the unconditional distribution of n 
is given by 

( 90 ) iir) 


\\ : .» .iv^' .... i(‘ -‘.vi 





SOME IMPROVEMENTS IN WEIGHING AND OTHER EXPERIMENTAL 

TECHNIQUES* 

By Harold Hotelling 
Columbia University 


When several quantities are to be ascertained there is frequently an oppor¬ 
tunity to increase the accuracy and reduce the cost by combining suitably in one 
experiment what might ordinarily be considered separate operations. The 
theory of design of experiments developed as a branch of modern mathematical 
statistics, and of which fimdamental considerations are set forth in R. A. Fisher’s 
book [1], provides many improvements of this kind. Since the main interests 
of Fisher and other originators of this theory have been in biology, the applica¬ 
tions so far made have been chiefly biological in character, excepting for certain 
economic and social investigations involving stratified sampling. The possi¬ 
bilities of improvement of physical and chemical investigations through designed 
experiments based on the theory of statistical inference have scarcely begun to 
be explored. 

The following example is due to F. Yates [2]. A chemist has seven light ob¬ 
jects to weigh, and the scale also requires a zero correction, so that eight weigh¬ 
ings are neceiisary. The standard error of each weighing is denoted by o’, the 
variance therefore by Since the weight assigned to each object by customary 
techniciues is the difference between the reading of the scale when carrying that 
object and when empty, the variance of the assigned weight is 2<7^ and its stand¬ 
ard error is <r \/2. 

The improved technique suggested by Yates consists of weighing all seven 
objects together, and also weighing them in groups of three so chosen that each 
object is weighed four times altogether, twice with any other object and twice 
without it. Calling the readings from the scale , • • • , j/s we then have as 
equations for determining the unknown weights a, 6, ■ - , g. 


a 

+ 

b + 

c + 

d 

+ 

C + 

/ 

+ 

a 

+ 

b + 

c 






a 



+ 

d 

+ 

e 



a 






+ 

/ 

+ 



b 

+ 

d 


+ 

/ 




b 



+ 

e 


+ 




c + 

d 




+ 




c 


+ 

e + 

/ 



2/1 

2/2 

2/3 

2/4 

2/5 

2/5 

2/7 

2 / 8 . 


5 Presented at the Wellesley meeting of the Institute of Mathematical Statistics, Aug. 
13, 1044. 
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Any particular weight is found by adding together the four equations containing 
it, subtracting the other four, and dividing by 4. Hius 

_ yi + +1/4 - Vi - ye “ »T - yi 

a =- 2 - • 

The variance of a sum of independent observations is the sum of their variances, 
as is well known, and the variance of c times an observation is c* times the 
variance of that observation. Taking c = } for the first four terms m the expres¬ 
sion for a and c — — i for the others gives for the variance of a by this method 
<r*/2, which is only one-fourth that for the direct method. The standard error, 
or probable error, has been halved. If a degree of accuracy is required calling 
for repetition a certain number of times of the weighings by 'the direct method, 
then only one-fourth as many weighings are needed by Yates' method to procure 
the same accuracy in the average. 

A further improvement, which does not seem to have been mentioned in the 
literature, will be obtained if Yates’ procedure is modified by placing in the other 
pan of the scale those of the objects not included in one of his weighings. Calling 
the readings in this case zi, • • • , 2$, we have 

o-hb + c-hd + e-t-/-|-y 
a + b + c — d — e—f — g 
a -b-c + d + e-f-g 
a — b-c — d — e + f+ g 
—a -^b — c-\rd — e-\-f — g 
-a + b — c — d-^e—f + g 

•MM ^1^ WMM ^ M^M ^ ^ ^ ^ 

From these equations, 

2i + 22 + «B + «4*“^i““*6— 

8 ' 

with a like expression for each of the other unknowns. The variance of each 
unknown by this method is <rV8. The standard error is half that by Yates^ 
method, or a quarter of its value by the direct method of weighing each object 
separately. The number of repetitions required to procure a particular standard 
error in the mean is one-sixteenth that by the direct method, 

A simpler example illustrating the same point is that of two objects to be 
weighed, with a scale already corrected for bias. Again let a be the variance 
of an individual weighing. If we weigh the two objects together in one pan of 
the scale, and then in opposite pans, we have as equations for the unknown 
weights a and b, 

a + h = a -- b Zif 


~ 2l 
= Zt 
^ Zi 
= Zi 
- Zi 
= Zi 
== 27 
= 28 



wmowsQ 


m 


whence 

a = (21 + 2»)/2, 6 « (21 — 2*)/2. 

The variances of a and b by this method are both equal to ff^/2, half the veine 
when the two objects are weighed separately. The means found from a number 
of pairs of weighings of sums and differences have the same precision as those 
found from twice as many pairs of weighings of the objects separately. 

Further economies of effort, or gains in accuracy, are possible with larger 
numbers of weighings and of objects to be weighed. These improvements can 
to some extent be applied also to other types of measurement, as of distances, 
since it is sometimes possible to measure the sum of a number of such quantities, 
or the difference between two such sums, with approximately the same accuracy 
as a single one of them. The outstanding case, however, seems to be that of 
weighing on a balance objects light enough so that their aggregate weight is 
below the maximum for which the balance was designed, since in this case it is 
quite reasonable to assume that the several recorded results all have the same 
standard error a and that they are independent. 

In what follows, some principles underlying the design of efficient schemes of 
this kind will be developed and applied to obtain some additional plans. How¬ 
ever no comprehensive general solution has been reached; this appears to be a 
matter for further mathematical research. Also, we leave aside in this paper 
the problem of estimating the error variance. All this discussion is based on 
the minimization of the actual variance. In order to utilize the results it is 
necessary that this variance be either known a priori or estimated from the 
residuals from the least-square solution. The latter type of estimate is in some 
ways more satisfactory, since it refers to the actual experiment rather than to 
some previous experiments which may not have been made under exactly the 
same conditions. But in order to have such an estimate it is necessary that the 
number of observations exceed the number of unknowns, and desirable that the 
excess shall have a large enough value to insure a stable estimate of the error 
variance <r*. The appropriate test for significance, or determination of confidence 
limits for the unknowns, must then utilize the Student distribution or its general¬ 
ization, the variance ratio distribution, which take full account of the instability 
caused by an inadequate number of degrees of freedom for estimating <r. 

It is only when a is known exactly apart from the experiment being designed 
that the criteria we here consider are exactly applicable. In other cases there 
may need to be a balancing, in the design of the experiment, between the de¬ 
siderata of minimum variance and of accurately known variance, with the accu¬ 
racy of this knowledge depending on the number of available degrees of freedom. 
A theory of design taking full account of this consideration would require a use 
of the power functions of the Student distribution and the variance ratio dis¬ 
tribution, discovered respectively by R. A. Fisher [3] and P. C. Tang [4]. 

We shall denote by N the number of weighings to be made, and by p the num¬ 
ber of objects to be weighed. In order that it be possible to determine the un- 
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known weights from the observations it is necessary that p ^ N, and if a possible 
bias in the scale must be eliminated by njeans of the same data it is necessary 
that p ^ N — I, Supposing these conditions to be satisfied, we shall show, 
among other things, that the minimum possible variance for one of the un¬ 
knowns is a/N ; that the experiment may be arranged so that a selected one of 
the unknowns has exactly this minimum variance excepting when N is odd and 
a bias must be allowed for also; and that for some, but not all, combinations of 
p and iV, this minimum variance is attained for all the unknowns simultaneously. 
This minimum value a^/N is of course equal to the variance of the mean of N 
weighings of one object alone, disregarding the rest; but it will be seen below 
that by complex experiments of the kind indicated, determinations from the 
same number of weighings of the other weights also can at the same time be 
made with some finite variance, which may or may not have the minimum value. 

The following notation will be used in the proof. I^et Xia — 1 or — 1 if the 
tth object is included in the ath weighing by being placed respectively in the 
left- or right-hand pan, and let Xia = 0 if the ith object is not included in the ath 
weigliing. Here i = 1, 2, • • • , p and a = 1, • * • , iV. Let ya be the result 
recorded for the ath weighing, let A* be the error in this result, and let 6,- be the 
true weight of the ith object, so that we have the N equations 

(1) ^lo^l "f” X2ahi Xpahp = ya H" Aa ; 

provided there is no bias, or if by ya we mean the observed weight corrected for 
a bias known a priori. Under these conditions the estimate of each of the 6/s 
having the properties of zero bias and minimum variance is that provided by 
the method of least squares. This statement, which docs not depend on any 
assumption of a normal or other particular form of distribution of the errors, 
has been known long but not widely, since there is an easier derivation of the 
method by the application to the normal distribution to the method of maximum 
likelihood. Its proof, due originally to Laplace,' has appeared in many forms 
in the work of Gauss and later authors [5]; the latest version is by the present 
writer [6]. 

Letting S stand for summation over all the N weighings we put 

(2) aij = SXiaXja , Qi = SXiaya , 
and write the normal equatioas in the form 


where ? stands for a sum with respect to j from 1 to p. From the usual theory 
of least squares (cf. for example the reference last cited) it is known that the 
standard error of the determination of bi from these equations—which is the 
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minimum possible standard error of bi for any way of combining the observa¬ 
tions—is 9 times the square root of Ah/A, where ‘ 

Oj! • • • flip 
fli* • • • dtp 

0/p2 • * • 

apd All is the minor of A obtained by deleting the first row and column. 

The matrices of A and of An are known to be positive definite or semi-definite. 
The semi-definite case is excluded by the consideration that the normal equar 
tions shall actually determine the unknowns. Hence the inverse of the latter 
matrix exists and is positive definite. But this inverse, which we may write 

P (1^2 •' * d2p 

9 

dpp. 


di^anajip 

i.?-2 

which is obtained by expanding A with reference to its first row and first column. 
The positive definite character of d therefore leads to the following 
Lemma: // Ow , • • • , aip (= 021 , • • • , a^i respectively) are free to vary while the 
other dements of A remain fixed y the maximum value of A/An is an , and is attained 
when and only when an ^ an = • • • = aip = 0. 

From this it is evident that the variance of 61 , namely An/A, cannot be less 
than o^/an , and will reach this value only if the experiment is so arranged that 
the elements after the first in the first row and column of A are all zero. That 
such an arrangement is possible may be seen by a consideration of the matrix 




consists of the coefficients in the identity 

A/All = Uii ^ 


an 

A - 

|a^i 


” Xn X 21 

* * * Xpi 

Xi2 • • 

• Xp2 

. XlAT • • 

• Xpj^^ 


whose elements are restricted to be Fs, —rs and O's. The condition an - 0, 
by ( 2 ), means simply that SxiaXia = 0 , a condition which may be expressed by 
saying that the first column of X is orthogonal to the tth colunrn. The condi¬ 
tion' that the variance of bi have its minimum value o^/an is thus, according to 
the lemma, that the first column of X shall be orthogonal to all the others. The 
minimum minimorum of this variance will be reached if the first row of X is 




302 


HAROLD HOTBLLJNO 


not only orthogonal to all the others, but consists entirely of I’s and — I’s, so 
that Oil “ N. The value of this minimum minimorum is a^/N, 

If there is a possible bias to this procedure needs to be modified by the addition 
of bo to the left member of (1) and subsequent treatment of this term like the 
others, putting xo. = 1 in (2), and modifying X by adjoining a column of I’s. 
The necessary and sufficient condition that the variance of bi shall equal v/N 
is then that the column 


*u 

Xu 


XlM 

shall consist entirely of Ts and — I's and shall be orthogonal to a column con¬ 
sisting of 1% and to all the other columns of X. 

If no bias needs to be eliminated the experiment can be arranged so that the 
variance of bi is a^/N merely by filling up the first column of X with I's and 

— Ts in any arbitrary manner, and then choosing the later columns so as to be 
orthogonal to this first one, and so that all are linearly independent. This can 
be accomplished, for example by choosing the first element in all the columns to 
be the same as that in the first column; choosing the ith element in the ith column 
(t = 2, 3, • • • , p) to be the negative of the tth element in the first column; and 
making all the other elements of X zero. 

When a bias is to be eliminated, so that there is a column of Ts in X corre¬ 
sponding to bo, it is necessary that N be even in order that the column of X 
corresponding to bi may consist of Ts and — Ts in equal numbers, without any 
O’s, a condition essential for the orthogonality between these two columns with 
the maximum value N for Uij. Supposing N even, let us assign the value 1 to 
each of the fimt N/2 elements of the column corresponding to bi and the value 

— 1 to the last N/2 elements of this row. The remaining rows of X may then 
be filled up by the same method as that indicated above for the case in which 
there is no bias. The variance of bi will then take its theoretical minimum 
value (t^/N, 

If N is odd and there is a possible bias, the column of X corresponding to bi 
can be filled up with Ts and — Vs in equal numbers, with a single zero, and the 
remaining columns can be made orthogonal to it. The variance of bi in this 
case will be a /{N — 1). 

The method suggested above for filling up the later columns of X is convenient 
for the proof, but is not usually to be recommended in practice, since other 
methods will in all but the simplest cases give smaller standard errors for the 
unknowns other than the first. For some values of N and p it is possible to 
determine all the unknowns with equal and minimum variance. These are the 
cases in which all the columns of X can be made mutually orthogonal and 
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wi^out leros, excepting that the column corresponding to b» may contain some 
seroB. Thus for AT 4 tire scheme of weighing represented byihe matrix * 

1111 ' 

1 1 - 1-1 
1-1 1-1 
.1 -1 -1 1 . 

whose colurzms are all mutually orthogonal, may be applied to weigh three ob¬ 
jects when there is a possible bias, or four where there is not, with variance 
9*/A for each of the unknowns in either case. The matrix X'X of the normal 
equations has the form 

'4 0 0 0" 

0 4 0 0. 

0 0 4 0 
_0 0 0 4_ 

Calling the results of the weighings yo, y\, yt, yz in the case of possible bias we 
have for the unknowns the expressions 

hi - (yo + yi- Vi - 2 / 8)/4 

h% ^ {yo - yi +y 2 - 2 / 8)/4 

bs = (2/0 - 2/1 - 2/« + 2/«)/4. 

The complete orthogonality exemplified by this design has several advantages 
besides the fact that the variance of each of the unknowns has the same mini¬ 
mum value as if all the weighings were to be devoted to it alone (or half the value 
of the variance of this unknown if half the weighings were devoted to it plus 
bias and half to determining the bias). The diagonal form of the matrix X'Z 
means that the labor of solving normal equations, which is sometimes formidable, 
is reduced to the trivial task of dividing by N, Also, the diagonal form of this 
matrix implies that its inverse is also of diagonal form, from which it follows 
that the estimates of the different unknowns are statistically independent. Con¬ 
sequently the variances, or standard errora, of linear functions of the unknowns 
are easy to find. Thus the variance of the difference between the estimates of 
two of the weights is simply the sum of their variances. But of course if the 
main object of the experiment is to determine a particular difference of this 
kind, or any other linear function of the weights, a different design should be 
sought to minimize the particular variance which is of interest. 

In contrast to the satisfactory design possible with four weighings, no complete 
orthogonality is possible with six weighings, or with any odd number, if the 
number of objects to be weighed is the maximum possible for the number of 
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weighings and if each object is actually to enter into each weighing in one pan 
or the other. For iV = 3 and bias known to be zero consider the scheme 


X = 


1 

1 

1 



f 


which corresponds to weighing two objects, first together in one pan, then in 
opposite pans, and then weighing one alone. Calling bi the weight of the object 
that has been on the scale through all three weighings and bi the other we have 
the estimates 


bi = (yi + y 2 + 2 / 8)/3 

bi ^ (yi- yi)/2, 


with respective variances 


2 2/0 
(Ti = (T /d. 


2 

(Ti 


= ffV2. 


Thus the first weight is determined with the minimum possible variance but the 
second is not. 

An alternative method of weighing under these same conditions is to weigh 
both objects in one pan together twice and to weigh them in opposite pans once. 
This gives 


X = 


1 

1 

1 



> 


with the normal equations 


36i + bi = yi + yi + yt 

hi + 3&2 = + 2/2 — 2/8, 

whose solution is • 

bi = ( 2/1 + 2/2 + 22 / 8)/4 
^2 = ( 2/1 + 2/2 - 22 / 8 )/ 4 , 

and variances 

2 _ 2 _ a 2 

OTi = a2 = #0- . 

Thus the weights are by this method determined with equal accuracy, which is 
better than by the preceding method for one of the objects but worse for the 
other. To choose between the two methods it is therefore appropriate to take 
into consideration the relative accuracy desired in the weights of the two objects. 
Either method is better than weighing the objects separately. 

Either of these two X matrices can also be made the basis for weighing a single 
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object when tiie sci^ is-suspected of having a bias. Ibe weight this obj^ 
will be estimated as 6i, and will have the variance i<r* by the first metiiod, or 
fff* by the second. Thus the second method is distinctly superior in this case. 

Orthogonality between columns obviously requires both negative and positive 
signs, corresponding to weighings in both pans of the balance. Thus the ex¬ 
perimental designs of maximum efficiency for weighing on a balance are not 
available with a spring scale, or in making measurements of any kind in which 
it is not possible to arrange that the quantities read off are differences. In such 
cases the elements of X are restricted to be 1 or 0. Let us now consider some of 
the simplest cases of this kind, assuming for simplicity that a — We shall 
deal only with cases in which there is no bias. 

For N = S,p — 2 the simple experiment of weighing one object twice and the 
other once yields variances J and 1 respectively. All other designs are in this 
case less satisfactory, with the possible exception of that specified by 


X = 


1 

1 

0 


1 

0 

1 . 


with bi = (y, + 2yi — ysVS and hj = (j/i — y* + 2yt)/3 having each the vari¬ 
ance of |. 

For N = 3, p = 3 the most efficient design is given by 


X = 


1 

1 

0 


1 0 
0 1 
1 1 


with bi = (yi + yt — y^l2, and bi and bt given by cyclic permutation in this 
formula. The variance of each unknown is |. 

For A" = 4, p = 3 a design having an advantage in some situations is that 
given by 


X = 


1 

1 

1 

0 


0 1 
1 0 
1 1 
1 1 


(together of course with those obtained by permutations of rows and of columns, 
as is to be understood throughout). The normal equations are 

3?)i -|- 26s + 2b) = yi -b J/s + 


26 i + 3bi + 2b) = yt + y) + yt 

2bi + 2bt + 3b) = yt + V) + Vi. 

An expeditious method of solution in this as in many similar cases is to add them 
all together and then subtract an appropriate multiple of the sum from each 
of the normal equations in turn. The variance of each unknown found by this 
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e}Q)eriiD^t is. 5/7 » .714. The simple experiment consisting d w^;hing one of 
the ohjeots twice and the others once each yields variances in one case larger 
and. in two .cases smaller than this. 

For iV » p n 4 the cyclic arrangement 

' 1110 “ 

1 1 0 l| 

10 11 ; 

.0 1 1 ij 

leads to variances all equal to 7/9. 

For a 5, p s 2 the most efficient design appears to be 

T 0' 

0 1 
1 1 
1 1 
1 1 

m 

The variance of each unknown is in this case 1/3. 
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ON THE ANALYSIS OF A CERTAIN SIX-BY-SIX FOUR-GROUP 
LATTICE DESIGN! 

By Boyd Harshbargek* 

Virginia Agricultural Experiment Station 

1. Introduction. The lattice consists of' groups of randomized incomplete 
blocks mth certain restrictions being imposed on the randomization within each 
group, and the number pf varieties is a perfect square. For example, if the num¬ 
ber of varieties is fc* = 36, then the orthogonal groups for a triple lattice, not 
considering randomizing within the blocks or between blocks, are as follows: 
(the numbers signify varieties). 


GROUP X GROUP Y 


Blocks 







Blocks 




(1) 

1 

2 3 

4 

5 

6 


(1) 

1 

7 

13 19 

25 

31 

(2) 

7 

8 9 

10 

11 

12 


(2) 

2 

8 

14 20 

26 

32 

(3) 

13 

14 15 

16 

17 

18 


(3) 

3 

9 

15 21 

27 

33 

(4) 

19 

20 21 

22 

23 

24 


(4) 

4 

10 

16 22 

28 

34 

(5) 

25 

26 27 

28 

29 

30 


(5) 

5 

11 

17 23 

29 

35 

(«) 

31 

32 33 

34 

35 

36 


(6) 

6 

12 

18 24 

30 

36 






GROUP Z 








Blocks 













(1) 

1 


8 

15 

22 


29 

36 





(2) 

2 


9 

16 

23 


30 

31 





(3) 

3 


10 

17 

24 


25 

32 





(4) 

4 


11 

18 

19 


26 

33 





(5) 

5 


12 

13 

20 


27 

34 





(6) 

6 


7 

14 

21 


28 

35 




This design is constructed so that no variety appears with another variety 
more than once in the same block. This important characteristic makes the 
analysis simple, as it enables the results to be treated as a factorial design. The 
analysis is well described by Yates [3, 4, 5] and Cochran [1]. 

Suppose another group, U, is now formed from a six by six lattice, for example, 
the following group: 

> Certain of the ideas presented here are embodied in the author’s unpublished doctoral 
thesis by the same title, Library, George Washington University, Washington, D. C., 1943. 

' The author wishes to express his appreciation to W. G. Cochran of Iowa State College, 
who advised freely in the preparation of the original thesis, and to Frank M. Weida of 
George Washington University. 
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GROUP U 


Blocks 


(1) 

31 

26 

21 

16 

11 

6 

(2) 

1 

32 

27 

22 

17 

12 

(3) 

7 

2 

33 

28 

23 

18 

(4) 

13 

8 

3 

34 

29 

24 

(6) 

19 

14 

9 

4 

35 

30 

(6) 

26 

20 

15 

10 

6 

36 


The important characteristic, that no variety appears with another variety in 
the same block more than once, does not hold. For example, varieties 1 and 22 
appear together in both groups Z and U. 

It is the purpose of this paper to develop the statistical method for the analysis 
of such a design, where each group is duplicated, and to apply the results to an 
actual problem. The least square solution, as developed here, uses only the 
intra-block information to correct the varieties for block effects. In a second 
article the solution using both intra- and inter-block information will be given. 


2. Estimation of the block and varietal effects. It is reasonable to assume 
in varietal trials that the general mean, and any particular block and variety 
effects, operate additively to produce the true mean of y associated with this 
block and variety. In particular, if 2 /«,y is the yield of the plot for the ^th variety 
in the ith block of the eth replicate,the following hypothesis may be set up, 
namely; 


( 1 ) 


Veij = M + P« + + Vj + €ei3 . 


Where n is the true or population mean yield, pe is the population replicate effect 
of the eth replicate, fid is the population block effect of the ith block in the eth 
replicate, Vj is the population variety effect of the ^th variety, and €*,; is the ex¬ 
perimental error of the eij plot. Since the design has eight replicates, that is, 
each group is duplicated,the block effects may be estimated from unpaired and 
paired replicates or partners. 

It is assumed that the c«iy are independently and normally distributed with 

common variance ^ . Without loss of generality, it also may be assumed that 

the sum of the replicate effects, the sum of the block effects within any replica¬ 
tion, and the sum of the variety effects are respectively equal to zero. 

The parameters are estimated by the method of least squares, subject to the 
restrictions stated in the preceding paragraph. This involves choosing the 
parameters so that 


^ I T 4 X 2 € ““ -S- ““ 


( 2 ) 




V 

- Xui - VjXtA 


2 2 
8 8 6 r rf 8 8 r i •• 

+ X, 2 r. + E x> r + £ X. E + X. E V, 


e-1 t-1 




J-1 


2 


2 
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is a minimum.’ Here is the dependent variate, and a;i, • • • , ^ 5 ^ are the 
independent variates. In ordinary regression problems, the values of the x 
variates, as well as the y variate, constitute a part of the originsd data. How¬ 
ever, in this case the y variate only is given, and the x variates must be con¬ 
structed. Thus, for the design, one takes xi = 1 for all values; Xie — I for all 
values in replicate e, but sero elsewhere; xui = 1 for all values in the ith block 
of the cth replicate and —1 for all values in its partner, but asero elsewhere; 
xui = 1 for all values in the ith block of the eth replicate and also 1 in its partner, 
but aero elsewhere; and x%j = 1 where variety j occurs, but zero elsewhere. 

One now takes the partial derivatives of the above equation with respect to 
the parameters and forms the normal equations. It can be shown that 
(Xi, • • • , X 4 ) are each zero. The normal equations not involving X^s are: 


Leading term 


(3) 


m 








e-1 t-i 


8 6 1 , T / 86 

+ 2itZZ‘ii^ + 8Ev,.o, 

e.l t.l iS y-l 


6 86 

kttn + fc*r, + k ^ ^ vj R,. 

i-i i-i 

k(u - U) + 2k = B.i - B'.i, 


h “I" 1 / 

Equations having ~ as leading terms. 


Equations having Vj as leading terms. 


In the above, N is the total number of values, k is the number of plots in a 
block, Te is the eth replicate effect, bei is the tth block effect in the eth replicate, 
Vj is the jth variety effect, G is the total sum of all values, Re is the sum of the 
values in the eth replicate, Bei is the sum of values in the ith block of the eth 
replicate and Vj is the sum of the yields of the ;th variety. The primes denote 
similar values of the partner terms. 

Using the restrictions 


(4) 


8 

Zr. 

49 ■■1 


8 6 T 

= EZ- 

i-1 


-it = E E — 

2 .-1 <-i 


bU 


= Z f/ == 0, 

j-i 


the values of the following parameters are directly obtainable: 

G R» 0 bri — b',i Bti — B',i 


( 6 ) 




R* _ 0 j 


r. 


/ 

r. 


2k 


• 8, for summation, will be used to represent summation over all values. S will be used 
in a more restricted sense. 



310 


BOYD HAS8BBABOBB 


h n • 

The values of the —- and p< effects cannot be obtained directly. In 

order to simplify the solution of these equations, only the mean, confounded 
blocks, and the varietal effects will be used. The results later will be corrected 
for replicate effects. If each of the yields is added to the corresponding yield 
in its partner, one gets an equation of the form 

(6) yik = M 4* jS** + >'* + 

where fit , and Vk are now twice their original values. These parameters are 
estimated by the method of least squares subject to the restrictions previously 
given. To distinguish them from the estimates derived in equation (3) they are 
designated with double primes (")• If Six is the total for block i in group X, 
that is from both pairs, and similarly for the other groups, the normal equations 


are: 

Leading term 

m" 144m" (7. 

bx\ 6m" + + (vi + V 2 + + v': + v,' + V,') - Bix . 

bu^ 6m" + + {vi + v/o + Vn + vio + vii + Vu) — . 

vi 4cm!* + bx\ + byi + bti + h^2 + ^Vi =* Fi. 

Vge 4m" + 6^6 + by^ + bgi + buz + 4t;86 == Fjm 


Let T»i be the total of all the varieties appearing in block 1 of group X from all 
the replicates; Tyi the total of all values appearing in block i of group Y from all 
the replicates, etc. Also 

(8) Cxi = 4B,< — Txi Cyi = ^Byi Tyi otc. 


Solving the equations: 


18 


Uyi 



and 

b'/l = TiT[(26ai + 0.8 + c^) + 3(C.2 + + CM 


(9) 


h'lz = Tiv[(26C^ + Cu + C^) + 3(C*i + + Cuz)] 


buZ = Ti¥l(26CM6 + C«2 + Cuz) + 3(C,1 + Cat 4- Cat)]. 


The values of &'"s calculated as above contain the replicate effects. To cor¬ 
rect for this, adjust the values so that the sum of the block effects for each 
replicate is zero. From the corrected V* values and the normal equations with 
the w'"s as leading terms, the corrected varietal sums are calculated. These are: 

(10) ^v! 4- 4m" = Vi — (sum over blocks in which Vj appears) 

where iv! + 4m" is the corrected varietal total for the jth variety. 
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8. Test of o^ ^iflc a n ce th« analyiJs of rarianoe. If the «’s have the 
previously defined values, the following identity occurs: 

V - mG + E r, + i: z - B',,) 

(11) + g g ^ ^ *•** (B^ + b:,) + g »jy, 

+ s(» - - r.a, - 

In th^ equation (11) 

m(? + i: r.f?. + E E (5,. - bU) 

, , «-i 0-1 m 2 

8 6 /T , s / V 86 

+ £ £ (k+w (B„ + b:,) + 2 .,r,. 

«-l M 4 M 

is the reduction in the sum of squares due to regression and 

q/.. ^ ^ ^ 

(13; o I y “ tnxi — -^— xu —-^- 

is the residual sum of squares. The reductions attributable to 

11 ~ <B« - b:.) 

«_i <.i Jt 

and 

(14) i t ik + k) (B„ + B'j, 

will be designated as component (a) and component (b) respectively. The 
residual mean square will be denoted by s*. 

The common test required is that of the null hypothesis that the variety effects 
fi, tit, ■ ■ ■ , vm are all zero. This test is made by calculating the reduction (Rt) 
to the sum of squares on all variates, and the reduction (Rv) due to the regression 
on all variates, except the A^riety effects. Rt — Rv is called the additional 
reduction to the sum of squares due to the v’s after fitting the renmining variates. 

The ratio {Rt — R,)/{82 — 47)** is distributed as F, as shown by Yates (6), 
with 35 and 205 degrees of freedom. The 35, 205, 82, and 47 degrees ol freedom 
pertain respectively to varieties, error, all constants fitted, and the total constants 
less the constants for varieties. 

Referring to formula (11) and the parameter effects, the sum of squares in the 
“Analysis of Variance Table” follow directly for replicate and component (a). 
Nair* in his recent article gives in detail the method for getting out the reduction 
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to the sum of squares for the entangled components. He shows that the reduc¬ 
tions for component (b) and the varieties may be written as: 


(16) 


gZECCrt and 

O §mmX t'wl 


8 N 


where the 6" have been corrected for replicate eflfects. It is well to note that 
b ^ ^ C,t is the reduction due to intra-block effects, freed of varietal effects. 

O «mmx <—1 

The reduction due to varieties corrected for block effects is given by 



This reduction can be used for testing the variety effects. 


ANALYSIS OF VARIANCE TABLE 


Replicate 

DIF 

(8-1) 

8 n2 ^2 

^ Tr 

Component (a) 

4(6-1) 

1 y ^ (5..- - bU)^ a (ft. - R:r 
sssa 12 72 

Component ( 6 ) 

4(6-1) 

gttCc- 

O emmx imml 

Varieties 

{ignoring blocks) 

(36-1) 

86 T72 r,2 

_ (r 

h S N 

Ettot 

205 

obtained by subtraction 

Total 

8(36)-l 

Sy^-^ 


4u Standard error of adjusted varietal means. For obtaining the standard 
error of the difference between two varieties adjusted by the intra-block informa¬ 
tion, this difference between two varieties can be expressed as a linear function 
of the plot yields. The standard error of the difference then can be obtained 
from the standard error of a linear function. To obtain the coefficients, it is 
well to draw a sketch of the plots, and put the coefficient of each plot on the 
diagram. In this way the proper multipliers can be found in a convenient 
manner. 

First consider the case where the two varieties appear together in the same 
block in both groups Z and U. One such pair consists of varieties (1) and (22), 
for which the varietal effects are designated by vi and V 2 » From equation (10) 
we have; 

(17) 4oi' = Vi- 4«i" - C - by\ - b'.\ - 

4Vit =Vn- Am" - C - - &«> • 
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Hiftiiilear fiulctian of the dfSereDoe betwe^ the varie&l efliMits is: ‘ ‘ ’ 

(18) “'V») ~ h»4 + i>w ~ iiii 

ynhete ' ■ • 

1 (45^ - r.i) (45 m Tm) ■ 

Ki - ^ (45m - I’m) ft"* - ^ (45 m - Tw)- 

The multi[diecs (e^^cept for the common factor 4 shown on the left of equation 
(18)] are: 


Nurriher- iyf Plots 

Multipliera 

CorUribtUion to vai'iancs 

4 

.3*+ 2 

3* 

36** + 48* + 16 
IF(9*^) 

A 

3* - 2 

36** - 48* + 16 

** 

3* 

TF(9*») 

A 

. 4 

64 

% 

3k 

W(!»¥) 

4(ib ■+ 2) 


36(* - 2) 
ir(9**) 

12(fc + 2) 

"^3* 

12(* - 2) 

Tf(9**) 

Total ■ 


72** + 48* 

t: 


Tr(9**) 


The variance per plot of the difference between two varietal means for varieties 
which occur t(%ether. in the same block in groups Z and U is: 


(19) 


and for A; » 6 is 


72jfc* + 48* 
2^ir(16)(9*) 


3* +2 
l2TrA: 


5 


18F’ 

Similarly the variance per plot of the difference between varietal means for 
other combinations are as follows: 


Combination* formula for k b 


both in same block in 

7 

groups Z or U 

24ir 

both in same block in 

8 

groups X or Y 

27Tr 

not together in the 

67 

same block 

216Tr' 
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6. Numerical Amdytie. (a) The data. In order to illustrate the appUoation 
of the method developed, an experiment used to test the yields of 36 hybrid com 
varieties is presented. This experiment was carried out on the Arlington Ex¬ 
perimental Farm, and the results are used through the courtesy of A, E. Brandt* 
and M. H. Jenkins/ 

Esmept for randomization, the plot yields are as shown in tables I to IV. 
(b). CalctdaHan for analysis of variance table. From page 9, the total sum of 
squares, the sum of squares for replicates, and the sum of squares for varieties 
ignoring blocks are obtained by substitution. They are: 


and 


where 


(90.9)* + (81.4)* + • • • + (101.0)* - c * 33,546.92 

(3291.8)* + (3300.3)’ + • ■ • + (2978.2)* _ ^ _ 2,289.6j 
36 

(741ii)- + (6M.6)‘+- - + (743.1)‘ _ ^ ^ 

8 


(26,935.9)* 

288 


2,335,662.80 


The block sum of squares, eliminating varieties, is made up of two parts, 
component (a) and component (b). From the formula on page 9 the reduction 
for component (a) is: 


(559.2 - 640.2)* + (547.0 - 522.4)* + • • • + (516.8 - 507.7)* 

12 

(3291.8 - 3300.2)* + (3266.5 - 3304.7)* 


Component (b) consists of differences giving an estimate of block yield freed of 
varietal effects. The C's are first calculated by using formula (8) and the 
results are as follows: 


C.1 (4B,i - Tsi) * 4(1,099.4) - 4203.2 = 194.4 etc. 

The 6^8 are calculated by using formulas given by (9), and then correcting for 
replicate effects by imposing the conditions that 

2) 4 *= 0 (e a: • • • tt) 

The corrected are: 

_ bi » 6. 79556, 6" = -3.83777, • • • 5m - -0.34306. 

^ Acting Chief, Conservation Experiment Station Divisions, Office of Research Soil Con¬ 
servation Service, U.S.D.A. 

* Principal Agronomist. In charge of Corn Investigations, Bureau of Plant Industry, 
U.8.D.A. 



TABmi 
GJlOtP X 


Ptei ¥Md BeptiaOe 1 2 


1 

90.9 

2 

81.4 

3 

95.4 

4 

96.9 

5 

96.6 

6 

98.0 

659.2 

7 

96.0 

8 

92.9 

9 

86.3 

10 

96.4 

11 

84.5 

12 

90.9 

547.0 

13 

79.3 

14 

97.5 

15 

95.9 

16 

80.2 


18 

112.1 

566.8 

19 

83.4 

20 

92.6 

21 

91.2 

22 

88.3 

23 

71.1 

1 

24 

111.8 


25 

93.7 

26 

86.1 

27 

68.0 

1 

28 ' 
109.2 

29 

94.9 1 

30 

93.3 

545.2 

31 

86.3 

32 

81.2 

33 

99.6 

34 

86.9 

35 ' 

93.4 

36 

88.9 

535.2 

2 528.6 

531.7 

536.3 

Plot Yie 

557.9 

'Id Replicxd 

542.3 

e V 

595.0 

1 

3291.8 

1 

105.8 

2 

83.5 

3 

68.6 

4 j 

99.0 

1 

5 

91.8 

6 

91.5 

540.2 

7 

98.6 

8 

71.9 

9 

81.0 

10 

98.6 

11 

90.3 

12 

82.0 

522.4 

13 

70.6 

14 

90.4 

15 

86.2 

16 

88.8 

17 

99.7 

18 

113.5 

549.1 

19 

94.6 

20 

89.6 


22 

99.2 

23 

73.2 

24 

90.9 

554.3 

25 

91.6 

26 

86.9 

27 

74.6 

28 

104.1 

29 

102.3 


555.5 

31 

91.1 

32 

95.1 

[ . 

33 

98.8 

34 

95.5 

35 

88.1 

36 

110.2 

578.8 

2 652.2 

516.4 


585.2 

545.4 

684.1 

3800.3 
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TABLE II 
GROUP Y 


PhA Yield BepUeate i 2 


1 

85.9 

7 

96.6 

13 

85.4 

19 

86.4 

25 

88.8 

31 

89.7 

532.8 

2 

87.1 i 

1 

8 

83.5 

14 

86.9 

20 

88.1 

26 

89.1 

32 

102.7 

537.4 

3 

88.4 

9 

85.5 

15 

90.4 

21 

99.7 

27 

80.5 

33 

100.2 

544.7 

4 

88.5 

10 

72.3 

16 

89.5 

22 

95.6 

28 

105.5 

34 

82.6 

534.0 

5 

87.0 

11 

88.1 

17 

101.8 

23 

82.5 

29 

86.8. 


546.3 

6 

90.8 

12 

83.4 

18 

101.2 

24 

120.2 

30 

72.6 , 

36 

93.1 

561.3 






1 

3256.5 


Plot Yield Replicate 2' 


1 

95.1 

7 

96.0 

13 

85.4 

19 

74.3 

25 

93.0 

1 

31 

86.7 

530.5 

2 

96.5 

8 

84.2 

14 

79.0 

20 

88.1 

26 ^ 
95.6 

32 

101.3 

544.7 

3 

95.4 

9 

83.3 

15 

95.9 

21 

99.0 

27 

66.0, 

1 

33 

105.3 

544.9 

4 

81.4 

10 

95.0 

16 

91.6 

22 

101.4 

28 

108.5.. 

34 

90.5 

568.4 

5 

91.8 

11 

107.6 

17 

84.2 

23 

66.1 

29 

87.6,. 

35 

101.6 

538.9 

6 

92.2 

12 

89.6 

18 

104.1 

24 

116.0 

30 

81.5 

36 

93.19 

577.3 





* 

t * 

. 

3304.7 
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TABLE IH 
GBOUP Z 


Plot Yidi RepUeaU S 2 


1 

87.3 

8 

79.9 

15 

86.9 

22 

105.0 

29 

99.4 

36 

90.3 

548.8 

2 

90.0 

9 

93.0 

16 

93.1 

23 

90.3 

30 

73.9 

31 

94.1 

534.4 

3 

91.2 

10 

92.9 

17 

90.6 

24 

92.3 

25 

76.0 

32 

97.2 

540.2 

4 

87.8 

11 

98.2 

18 

125.1 

19 

102.1 

26 

88.3 

33 

98.1 

j 

599.6 

5 

84.3 

12 

93.7 

13 

73.9 

20 

84.4 

27 

66.7 

34 

89.8 

492.8 

6 

90.1 

7 

91.3 

14 

85.4 

21 

95.4 

28 

93.7 

35 

101.6 

567.5 



Plot I 


■1 


3273.3 

1 

99.4 

8 

83.5 

15 

70.9 

22 

99.9 


36 

69.0 

626.7 

2 

92.2 

9 

96.0 

16 

98.1 

23 

74.7 

30 

89.8 

31 

103.6 

554.4 

1 

3 

91.9 

10 

106.4 

17 

92.7 

24 

108.3 

25 

88.8 

32 

103.4 

591.5 

4 

99.0 

11 

105.4 

18 

101.2 

19 

78.1 

26 

90.5 

33 

98.1 

572.3 

5 

63.7 

12 

94.4 

13 

80.7 

20 

78.4 

27 

71.3 

34 

86.9 

476.4 

6 

81.5 

7 

88.6 

14 

90.4 

21 

84.1 

28 

93.7 

35 

88.9 

627.2 







3246.6 
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TABLE IV 
GROUP U 
Plot Yield Replicate 4 


31 

98.5 

26 

99.3 

21 

97.6 

16 

90.9 

11 

102.5 

6 

91.5 

580.3 

1 

96.9 

32 

92.3 

27 

70.0 

22 

99.2 

17 

85.6 

12 

100.6 

543.6 

7 

77.8 

2 

79.9 

33 

98.1 

28 

104.8 

23 

88.9 

18 

116.7 

565.2 

13 

78.6 

8 

81.3 I 

i 

3 

84.1 

34 

86.9 

29 

91.3 

24 

111.8 

534.0 

19 

82.6 

14 

89.0 

9 

75.0 

4 

94.1 

35 

108.3 

30 

96.7 

546.7 

25 

87.3 

20 

76.2 

15 

89.7 

10 

83.7 

5 

82.2 

i 

36 

96.7 

516.8 







3284.6 


Plot Yield Replicate 4' 2 


31 

83.1 

26 

88.3 

21 

84.8 

16 

94.5 

11 

73.6 

6 

84.4 

508.7 

1 

80.9 

32 

82.6 

27 

73.9 

22 

94.8 

17 

76.8 

12 

83.4 

491.4 

7 

93.3 

2 

85.0 

33 

100.2 

28 

87.8 

23 

83.2 

18 

94.7 

544.2 

13 

74.6 

8 

68.2 

3 

45.2 

34 

79.0 

1 

29 

87.6 

24 

102.1 

456.7 

19 

69.1 

14 

81.2 

9 

80.3 

4 

68.8 

35 

83.7 

30 

86.4 

1 

469.5 

25 

89.5 

20 

71.7 

15 

75.1 

10 

97.1 

5 

73.3 


507.7 







2978.2 
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To get the reduction due to comp<ment (b) the above results are substituted in 

iSZCCrt- 1^9.96. 

P ftaal 

The necessary results are now available for the ‘‘Analysis of Variance Table.’^ 
THE ANALYSIS OF VARIANCE TABLE 


Source of Variation 

Degreee of Freedom 

Sum of Squares 

Mtan Square 

Replications 

7 

2,289.68 

327.097 

Component (a) 

20 

1,416.27 


Component (b) 

20 

1,389.96 


Blocks {eliminating 

40 

2,805.23 

70.131 

varieties) 

Varieties {ignoring 

35 

15,825.09 


blocks) 

Error 

205 

12,626.92 

61.596 

Total 

287 

33,546.92 



(c) . Teat of aignificance. There frequently will be large differences between 
varieties so that a test of significance may not be needed. If a test is needed, 
one involving only intra-block information may be used. For this purpose, it is 
necessary to calculate the sum of squares for varieties eliminating block effects 
as shown by formula (16): 13,946.28. The mean square will be 399.893, and 

p = 6.49 which is highly significant. 

0I.595 

(d) . Corrected varietal totals and means. The right-hand side of equation (10) 
gives the corrected variety totals, and when divided by eight gives the varietal 
means. These corrected varietal means can then be compared to determine the 
best variety. The corrected varietal totals and means are: 


Corrected Varietal Totals 


1 

2 

3 

4 

5 

6 

743.22 

669.05 

652.00 

705.80 

672.04 

720.32 

7 

8 

9 

10 

11 

12 

747.69 

668.58 

664.57 

751.39 

739.54 

736.96 

13 

14 

16 

16 

17 

18 

642.31 

700.44 

686.41 

713.21 

730.79 

857.26 

19 

20 

21 

22 

23 

24 

666.95 

676.40 

756.25 

801.34 

619.46 

868.84 

25 

26 

27 

28 

29 

30 

704.17 

699.83 

667.71 

814.04 

763.51 

679.44 

31 

32 

33 

34 

35 

36 

. 721.79 

767.48 1 

1 

783.42 

726.05 

780.48 

700.37 
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Corrected Varietal Means 


1 

2 

3 

4 

5 

6 

92.902 

83.631 

81.500 

88.225 

84.005 

90.040 

7 

8 

9 

10 

11 

12 

93.449 

82.322 

83.071 

93.924 

92.442 

91.994 

13 

14 

15 

16 

17 

18 

80.289 

87.555 

85.801 

89.151 

9^.349 

107.168 

19 

20 

21 

22 

T-r ,r , 1 

23 

24 

83.244 

84.425 

94.531 

100.168 

77.432 

108.605 

25 

26 

27 

28 

29 

30 

88.021 

87.479 

70.964 

101.755 

95.439 

84.930 

31 

32 

33 ' 

34 

35 

36 

90.224 

94.685 

97.927 

90.756 

97.560 

96.046 


When comparing one variety with another it is necessary to know the standaixi 
error of the mean difference, in order to judge whether this diflference is signifi¬ 
cant. The formulas for the standard error of the difference bet\^een mean yields 
differ for those sets of varieties which occur together in the same block in groups 
Z and U, in groups Z or U, in groups X or Y, or do not occur together in the 
same block. The formulas foi* these calculations are, respectively: 

(19), (20), (21), and (22). For example, the standard error of the 
difference between two variety means, such as variety 1 and 2, is 

^(61.595) = 4.27. 
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NOTES 

This section is devoted to brief research and expository articles on methodology 
and other short items, 

ON THE EXPECTED VALUES OF TWO STATISTICS 

By H. E. Robbins 
Post Graduate School; Annapolis, Md, 

In a previous paper\ the following theorem was proved. I^et Z be a random, 
Lebesgue measurable subset of Euclidean m dimensional space Efn , and let 
mC-X") be the measure of X, For every point x of let p(x) be the probability 
that X contains x. Then 

(1) £;(m(X)) = f p{x)dix{x). 

In the present note we shall show how this theorem may be used to find the 
expected values of two statistics which arise in sampling theory. Applications 
to similar problems may suggest themselves to the reader. 

I^et F be a real random variable with c. d. f. (cumulative distribution function) 
cF(y), so that for every y, 

(2) Pr (K < y) = 

Jjct Yi, • • • , Ynhcn independent, random variables, each witli the distribution 
of F. Finally, let 

A = min (Fi, • • • , FJ, 

B = max (Fi, • • • , Fn), 

(3) R ^ B - A, 

F = <r(R) -- c{A), 

Although the values of E{F) and E{R) can be found from the sampling distribu¬ 
tions of F and R, and, in fact, are well known, we shall show how to apply (1) 
to find EiP) and E{R) directly. 

To find the first of these, let X denote the set of points in the interval 0 < x < 1 
such that 

(4) o’(i4) < X < a(B), 

Then X is a random set with measure 

(5) m(X) = F. 

Moreover, for any point x the probability that X shall contain x is clearly 

(6) P(x) « 1 - x” - (1 • x)\ 
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Hence by (1), 

(7) Ei^iX)) 
Thus by (5), 

( 8 ) 



— (1 — x)"] dx 


n — 1 
n + 2' 


EiF) 


n — 1 
n + 1 


This result may also be derived by the usual method. In fact, it is not hard 
to show that the c. d. f. of F is 

(9) r(/) = Pr(F</) = (l-n)/” + n/"-‘ for 0</<l, 


whence 


( 10 ) 


E{F) 


= = (1 - n)n f df + n(n - 1) j[V~' df 


(1 — n)w n(n ~ 1) ^ n — 1 
w + 1 n n + 1‘ 


Here the advantage of using (1) is only that it makes unnecessary the calculation 
of the c. d. f. t(J), provided that only E{F) is desired. 

The situation is quite otherwise with E(R), Here the c. d. f. of R is 


/ oo ^o-t*r r ni 

<pia) J ».(6)M ^(0dM"‘*d6dd, 


where ^ is the probability density fimction of Y. Unless ^ is a very simple 
function, it would seem difficult to find a simple expression for the integi-al 


( 12 ) 


E{R) = ^ rdJBif). 


However, if we let X now denote the linear set 

(13) A <t<B, 
then 

(14) /.(X) = B - A R. 

The probability that X shall contain the point t is now 

(15) Pit) = 1 - v"«) - (1 -• <r(0)", 

SO that, by (1) and (14), 

(16) EiR) - /* {1 - ^"(0 - (1 - vCO)"} dt. 

This formula for the expected value of the range in a sample of n from a popular 
tion Y with c. d. f. <r(t) is believed to be new. 
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If «r(t) is such that can be found Rs an explicit function <A a, thon (16) 
can be written with advanta^ as 

(17) E{R) - {1 - <r» (1 - a)"] ^da. 


For example, suppose the random variable Y has the probability density function 


(18) 


<p(y) 


(1 + «>')*’ 


and hence the c. d. f. 

(19) <r(y) ■= 
Then 

(20) f = log , 

1 


l + e»' 

M ^ 1 

da <r(l — a)' 


Hence from (17), the expected value of the range in a sample of n is 

( 21 ) 

The indicated division in the integrand may be carried out, and the result, a 
polynomial in a of degree <(n, — 2), when integrated between 0 and 1, gives 
an explicit formula for E{R). Thus for samples of n = 2, 3, 4 we find the values 
of E{R) to be respectively 2, 3, 11/3. Incidentally, it is always true that the 
expected value of the range for n — 3 is three-halves that for n = 2. This 
follows from (16) and the algebraic identity 

(22) {1 - - (1 cr)*} = ill - - (1 - cy)^j. 


REFERENCES 
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ON RELATIVE ERRORS IN SYSTEMS OF LINEAR EQUATIONS 

By a. T. Lonseth 
Northwestern University 

Some time ago in these AnnaZs\ L. B. Tuckerman discussed the effect of rela¬ 
tive coeflHicient errors on relative solution errors for a non-singular linear algebraic 
system; his discussion was confined to errors so small that their squares and higher 
powers can be neglected. Dr. Tuckerman's paper was principally concerned 


^ L. B. Tuckerman, “On the mathematically significant figures in the solution of simul¬ 
taneous linear equations,“ Annals of Math. StaL^ Vol. 12 (1941), pp. 307-316. 
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with the important problem of limiting errors incurred while solving the system, 
and has suggested the desirability of a non-infinitesimal treatment of relative 
errors. Such a treatment follows; the method is a variant of that used in a 
paper on absolute errors®. The computations provide (1) a criterion for allow¬ 
able relative errors in the coefficients to assure non-vanishing of the deter¬ 
minant; (2) a bound (subject to this criterion) for the relative error in each solu¬ 
tion; (3) a specification of accuracy in the coefficients to assure a pre-assigned 
accuracy in the solution. 

We consider a system of linear equations 

n 

(1) £ onXj = Ci * =« 1, 2, ■ • • , n, 

where none of the following vanish: the n(n + 1) coefficients a,y, c, ; the deter¬ 
minant A of the system; and the n solution-components Xi . Under these condi¬ 
tions it is possible to speak of ‘^relative errors^’ in the a% c% and x% Let 
€ij , <r, be the relative errors in an and c* respectively, so that aij is perturbed to 
ao(l + €ii); Ci to Ct(l + <r»). We inquire as to limitations on and <r,* which 
will permit solution for Xj(l -f- py) of the system 

(2) X) + €<y)^y(l + Pi) = Ci(l + (Ti) i « 1, 2, • • • , n, 

/-I 

where pj is the relative error induced in xj ; and we seek to limit | py | in terms 
of the c's and it's. We shall assume that for all i, j 

(3) |€yy|,|iry| <a, 

where 5 will be suitably restricted later. 

Combining (1) and (2) we get 

n n n 

(4) X o,ijXjPj = CiCTi — X — X o^ijUiXiPj i = 1, 2, • • • , n, 

,-i y-i y-i 

Since by hypothesis the determinant A of (1) is not zero, matrix A = {an) has 
the inverse = (6yy) = (Ay^/A), where A<y is the cofactor of a<y in A. Mul¬ 
tiplying (4) by hki and summing on i we get 

n n n 

(6) aj*p* ■* X — X 

«*-*l <-l imml 

n n 

X (ki^jXjpj fc as 1, 2, • • • , n, 

i>ml }.l 

and by (3) 

(6) P ^ B{Mk + Nk\ Pk 1)> 

* A. T. Lonseth, **Systems of linear equations with coefficients subject to error,” AnnaU 
of Math. Stat., Vol. 13 (1942), pp. 382-337* 
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where p 18 the greatest I pt I, and 

(7) S ^ 

(8) Nk » .-^i S 2 I ikiOtiXi I. 

\Xk\i~t i-t 

80 that 

Mk *» iV* + r-^ 2 I ^UCi I. 

\Xk\ M 

Denote by M, N the maximum values of Mk, Nk respectively over A;» 1,2, • • •, 
n. From (6), 

P ^ a(itf + Np), 

whence, if 

(9) a < I/N, 
it follows that 

(10) p ^ SM/a - SN), 

which of course bounds each individual | pt |, though rather crudely. To 
bound I pk I more genuinely in terms of S, Mk, Nk, M and N it remains only to 
use this inequality with (6): 

(11) I p* I ^ SiMk + SMNk/(l — SN)), k = 1, 2, • • • , n; 

with Mk, Nk as given in (7) and (8). 

If (9) holds, then, it follows that | pk | is bounded by (11)—if pk exists. This 
essential point can be established by solving (5) for pk by iteration^; (9) is a 
sufficient condition for convergence of the resulting series, and hence for non¬ 
singularity of the perturbed matrix (o,y + €i>o</). 

In order to be sure that | pk | ^ i), a pre-assigned number, for all k, it suffices 
by (10) to choose a so that 

aM/(l - SN) g n, 

whence 

a ^ v/{M -H Nv). 

A less simple inequality whose satisfaction by a will guarantee that | pt | ^ 
follows from (11), namely 

a g (A - B)/2C, 

where A = {(M* - Nip.f + iMNkVk]*, B = Mk +Nnk,C = \MNk-NMk\. 
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A RECIPROCITY PRINCIPLE FOR THE NEITMAN-PEARSON THEORY 
OF TESTING STATISTICAL HYPOTHESES 

By Louis M. Court 

Iq contrasting the tested hypothesis Hi with the alternative Ht, i^., in 
comparing the probability distribution p(xi, • ■ ■ , Xn | Hi) associated with the 
first hypothesis with the distribution p(xi, • • • , x„ J Ht) associated with the 
second, Neyman and Pearson select the best critical region R* from the infinite 
set of critical r^ons R of & specified size a by minimizing the probability 

(1) / p{Xi , • • • , Xn I //*) fltel • • • dXn , 

Js-R 

of accepting Hi when Ht is true (Type II Errors) subject to the constancy of the 
probability of rejecting Hi when Hi is correct (Type I Errors), 

(2) / p(xi, • ■ ■ , x„ I Hi) dxi dxn = a. 

S in (1) represents the whole of variate (xi, • • • , Xn) space and S — R, the com¬ 
plement of R relative to S. 

Obviously (1) is conditionally minimized when 

(3) / p{xi , • • ‘ , Xn 1 F 2 ) ctci • • dXn 

is maximized subject to (2), Neyman and Pearson have shown that if one or 
more members of the one parameter (X) family of regions R*(\) defined by the 
inequalities 

(4) p(Xi , • • • , Xn 1 fr2) ^ Xp(Xi , • • • , Xn I /fl) 

satisfy the ‘^side^^ condition (2), they will be best critical regions maximizing (3) 
subject to (2) or minimizing (1) subject to (2)\ As suggested by the notation, 
the family R*(\) depends upon X and, if sufficient restrictions are imposed upon 
p(xi, • • • , Xn I Hi) and p(xi, • • • , Xn | Hz), there is one and only one region for 
every value of X lying in an interval contained in the positive half-axis. X, 
itself, is clearly a function X(a) of a. Consequently, R*(\) depends upon a 
and may be written as R*[a]. The best critical region for a preassigned size 
a is given by R*[a]. 

Will we get the same best critical region if among the regions T that fix the 
probability of Type II Errors, 

(5) / p(Xi , • • • , Xn 1 i/ 2 ) dXl * • • dXn = 1 - jS, 

JS^T 


* For a full exposition, see Neyman and Pearson, Slat, Rea, Memoirs, Vol. 1 (1936). 
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we find the one that nunhnizes the int^al in (2) with R replaced by T, i.e. 
if we find the one that minimizes the probability of Type I Errors? We ahaU 
call this turnabout (rf the usual process the reversed Neyman-Pearsop principle. 

To discover the answer, we note that / p(®i ,•••,*« I d*i ••• dx, is equal 

ia 

to unity and (5) may be rewritten as 

^ p(^i j * * * , 1 dxi • • • dxn — d* 

The regions that minimize the left side of (2) with R replaced by T subject to 

(6) are obviously identical with those that maximize the negative of this left 
side subject to (6) multiplied through by —1. The latter problem is formally 
identical with the one referred to in the second paragraph of this note and, in¬ 
voking Neyman and Pearson's result, we conclude that the said conditional 
maximization is effected by the one parameter 0*) family of regions T*(jr^) 
defined by the inequalities 

(7) -p(xi , ••• ,x„\Hi) ^ -np(xi , • • •, x„ I jfifs) 


or 


p(xi , • • • , x„ I i/s) ^ - p(xi, 


,Xn\m). 


/a”* in denotes the reciprocal of # 1 . It is clear from (4) and (7) that the 

families of regions R*{\) and satisfying the direct and reversed Neyman- 

Pearson processes, coincide. 

As before, n is some function of /3. Hence, ;8 is a function of m and. 


accordingly, a function 



of a. 


Consequently, if the level at which the 


probability of Type II Errors in the reversed Neyman-Pearson process is held 

constant is taken equal to 1 — |8 in terms of the level a at which the 

LX(a)J 

probability of Type I Errors is fixed in the direct Neyman-Pearson method, the 
reversed and direct processes yield the same best critical region. This is the 
reciprocity principle cdluded to in the title of this note in Us fvU ampleteness. 


’ will generally be distinct in form from a(X), although the second line in (7) coin¬ 
cides upon the substitution of X for it~^ with (4), since the integrand in (5) is p(xi, , 

Xn I Ht) whereas that in (2), regarded as a constraint in the direct process, is p(xt, ••• , 
Zn I Hi). 
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AN INEQUALITY DUE TO H, HORNICH 


By Z. W. Birnbaum and Herbert S. Zuckerman 


University of Washington 

H, Hornich^ proved a theorem on the average risk of the sum of equal insurance 
policies. It seems of interest to note that when translated from its actuarial 
formulation into the terminology of the calculus of probabilities this theorem 
becomes an inequality for mean deviations of random variables, and to present* 
it with a concise proof in non-actuarial language. 

Let a; be a random variable with a symmetrical probability distribution, Di = 
£( I X 1) its mean deviation, xi, xj, • • • , independent repetitions of x, and 
Dn E{\xi + Xf + • • • + Xn\) the mean deviation of Xi + xj + • • * + Xn. 
Then Dn fulfills the inequality 


( 1 ) 



^here 


"n" 


denotes the greatest integer g -. 


If the distribution of x is not 


symmetrical but E(x) = 0, the inequality becomes 


( 2 ) 



The proof will be given for a continuous random variable but it clearly holds 
quite generally. If /(x) is the probability density of x, and E{x) = 0, then one 
has 


(3) 


Di 


£ -Ho 

I X I /(x) dx = 

to 



dx. 


In the expression for Dn , the integration over the entire n-space (xi, X 2 , •, Xn) 
may be performed by integrating separately over each of the 2^* “octants'^ which 
correspond to the different combinations of signs of the coordinates, and thus one 
obtains, for a symmetrical distribution, the estimates 



'00 O""! 


£ 


f ej—dbl, •' *1 


/ / ■ / 

•gnar2—02 ogn*,,— 



//•■•/ 


l(n^/2J 

^2 E 


•gii«l»ogn»2'~’ * —■■*n**—l 


// ■/ 


»gn«i—•gnss«>* • 

•gnx, + i—* • —ignxn—+l 


^ Hans Hornich, “Zur theorie des Bisikos,” MonaUh. Math. Phys., Vol. 60 (1941), pp. 
142-160. 
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If X is not symmetrical but E(x) = 0, we consider the random variable x' 
with the probability density g(x') — fi—x')y and the random variable y =» 
X 4- x\ In view of (3) we find 


f 1X + a:;' I f(x)g(x') dxdx' ^ 2 f f (x + x')f{x)g{x') dx dx' 
« J-oo Jq 

- 2 J g(x') xf(x) dxdx' + 2 f(x) x^g{x') dx* dx 

= E{x) g{-x) dx + fix) dx| = Eix). 

Let Xi , X 2 , • * * , x„ and xj, xj, • * * , Xn be independent repetitions of x and x', 
respectively, and yt = x< + x(. Since y has a symmetrical distribution, an 
application of (1) gives 

Ei\ Xi + X2 + • • • Xn I) = i{®(l Xl + • • • + Xn I) + S(| xj + • • • + Xn |) | 

^ m\Xl + x[ + ••• +Xn+x;|) - J£(|yi+ ••• +yn\) 

^ l E(\y |)n /”r'r-|^\ ^ -®(l * |)n 

vBJ/’ ='" 

An application of Stirling’s formula shows that the right hand sides in (1) and 
(2) are the order of magnitiKle of y/n. 
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NOTE ON A LEMMA 


By Abraham Wald 
Columbia University 

In a pi’evious paper on the power function of the analysis of variance test^ 
the author stated the following lemma (designated there as Lemma 2): 

Lemma 2. Let vi, • • • ,Vk be k normally and independently distributed variates 
mih a common variance a. Denote the mean value of Vi by ai (i = 1, • • • , fc) 
and let f{vi , • • • , , e) he a function the variables vi, • • • j Vk and cr which 

does not involve the mean values ai, * * • , at. Then, if the expected value of 
f(vi, • • • , Vk, <t) is equal to zero, f(vi , • • • , wa , <r) is identically equal to zero, 
except perhaps on a set of measure zero. 

In the paper mentioned above it was intended to state this lemma for bounded 
functions f(vi, • • • , Vk) and the lemma was used there only in a case where 
f(vi ,••• ,Vk) is bounded. Through an oversight this restriction on f(vi, • — , Vk) 
was not stated explicitly.^ The published proof of Lemma 2 is adequate if 
f(v \, • • • , Va:) is assumed to be bounded. From the fact that the moments of a 
multivariate normal distribution determine uniquely the distribution it is 
concluded there that if for any set (n , • • • , rk) of non-negative integers 

( 1 ) f f »!*••• dv, dVk = 0 

J—00 J—oo 

identically in the parameters ai , • • • , a/t then f{vi, • • • , Vk) must be equal to 
zero except perhaps on a set of measure zero. This conclusion is obvious if 
f(vi ,Vk) is bounded. In fact, from (1) and the boundedness of/(t;i , • ‘ , Vk) 
it follows that there exists a finite value A such that 

‘ , • • •. «*) = [i - i/(»->’••• > 

is a probability density function with moments equal to those of the normal 
distribution 


Hvi, 


, Vk) 




Hence f(vi, • • • ,Vk) must be equal to zero except perhaps on a set of measure' 
zero. However, this conclusion is not so immediate if no restriction is imposed 
on f(vi, • • • ,Vk) except that 

( 2 ) ’ L ‘ ‘ » ‘'a) I dvi dvk < 


for all values of the parameters ai, • • • , a*,. It is the purpose of this note to 
prove this, In other words, we shall prove the following proposition: 


^ A. Wald, the power function of the analysis of variance test,” Annals of McUh. 
Stat., Vol. 13 (1942), pp. 434. 

* 1 wish to thank Prof. J. Neyman for calling my attention to this omission. 
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PROPOsmoN I. If (2) holds for aU values of the parameiers ^ , a* oHd 
if for any (n, • • • , r*) of nonrnegative integers eguation (1) holds identically 
in ai, •••, ah, then f(pi, • •• ,Vk) must be equal to zero except perhaps on a set 
of measure zero. 

On the basis of Proposition I and the arguments given on p. 438 of the paper 
mentioned before, it can be seen that restriction (2) on the function f{vi, * * * , vji) 
is sufficient for the validity of Lemma 2. 

To prove Proposition I, we shall first show that the following lemma holds. 
Lemma A. If h(vi ,••• ,Vk) is a probability density function and if. 

(3) I 'I feC®! .•••>»*)« ^ dvi ■ ■ • dOi < CO 

JL^ce «L>oo 


for some S > 0, then the problem of moments is determined for the moments of the 
distribution h{vi , • • ■ , v*). 

This lemma was proved by G. H. Hardy for fc — 1.® I shall prove it for 
A; > 1. Since 


r^(L\vi\r 


~(2n)t 


< € 


,tZI.il 


(4) 'L 
we obtain from (3) 


dvfc < 00. 


Hence 


( 6 ) 



|.+«0 


(t.?”)] 

f 

f— 
8 

y _ 


L 

_ n —0 

(2n)! J 


dui •• • dvk < ». 


Denote the 2nth moment of j;,- by jua» • Because of (3) the moments are 

k 

finite. Furthermore, denote ^ ai 2 » by Xsn . Then we obtain from (6) 


(7) 

From (7) it follows that 

( 8 ) 

Hence 

(9) 


5*"Xsn 
» (2n)! 


< 


00 , 


lim sup 

n—» 


(2n)! 


< 1 . 




< 1. 


* See for instance, Shohat and Tamarkin, *‘The problem of moments,’^ Hath. Surveys 
No. 1, Amer. Math. Soe., New York, 1943, p. 20. 
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Since according to Stirling’s formula 

lim (2n)l/(2n)*"c~*"'\/i«i = 1 


2nr-‘ 


< 1 . 


we obtain from (9) 

(10) lim sup 

n«-« 

Taking reciprocals we obtain 

(11) liminf?^^^^> 1 

nmrnm to 


lim inf > ^ > 0. 


or 

(12) ***** **** o 

But (12) implies the existence of a positive value p so that 


(13) ^ (n = 1,2, • • •, ad inf.) 

71 

From (13) it follows that 

(14) 

flail 

But (14) is Carleman’s sufficient condition for the determinateness of the prob¬ 
lem of moments. Hence Lemma A is proved. 

On the basis of Lemma A we can prove Proposition I as follows: From (2) 
we obtain 

(15) \f{vx, ■■■ ,v,)\ dvi--- dv, < =c 


for all values ai, • • • , ait. Let fi(v) == f{v) for all points v = (vi , • • • , Vk) 
for which f{v) > 0, and/i(i;) = 0 for all points v for which /(v) < 0. Similarly, 
let flip) = —/(v) for all points v for which f(v) < 0, and f^iv) = 0 for all points 
V for which f(v) > 0. Then fi(v) and f^iv) are non-negative functions and 

(16) f{v) = /i(») - ft(v}. 

From (15) it follows that 

(17) 1 ^ (fi>i • • • da* < « 

*-•0 J —• 

and 

£^/*(a)e“**’5+*“‘’* dvf - dvk< «, 


(18) 
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Let 

(19) /?(!»)=O'-1, 2). 
Now we shall show that for any positive values ft, • • • , |8* 

(20) ' L » ■ ’ ■ * d»i dvi, < CO. 

In fact, consider the 2* sets (oi, • • • , o*) where o< = ±1 (i == !,•••, k). 
Denote by j2ai -at the subset of the fc-dimensional Cartesian space which cob- 
sists of all points v — (vi, ■ , Vk) tor which Vi is either zero or signiun Vi * 

signum at {i — 1, ■ ■ ■ , k). Putting a,- = OiPi , it follows from (17) and (18) 
that 

(21) f fdvx , • • • , - dp, ... d,;* < 00. 

Since (21) holds for any of the 2* sets Ko,...,,, equation (20) is proved. 

From (1) it follows that 

f pI‘••• p?[/!‘(Pi, •••,»*)-/s(Pi, ••• ,P*)]dPi •••<&*« 0, 

ol—ao X-oo 

for all non-negative integers n, • • • , r* . Hence, because of (21) and Lemma A 
we see that 

(22) fiivi, ••• , Vh) - ftivi, , Vk), 

except perhaps on a set of measure zero. From (22) it follows that 
f{vi ,Vk) = fi{vi, ,Vk) - Mvi , ,Vk) = 0, ‘ 
except perhaps on a set of measure zero. Hence Proposition I is proved. 

A NOTE ON SKEWNESS AND KURTOSIS 

By .1. Ernest Wilkins, Jr. 

UniversUy of Chicago 

It is the purpose of §1 of this paper to prove the following inequality: 

I 

(1) 04 ^ 0(3 + 1. 

This inequality seems to have first been stated by Pearson\ The inequality 
also follows from a result appearing in the thesis of Vatnsdal. Here we give a 
proof based on the theory of quadratic forms which seems to be more direct 
and more elementary than either of the previous proofs. 

^ Mathematical contributions to the theory of evolution, XIX; second supplement to 
a memoir on skew variation,’’ Phil, Trans, Roy, 8oc, (A), Vol. 216 (1916), p. 432. 
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The inequality ( 1 ) obviously shows that 04 ^ 1. It is then natural to ask for 
an upper bound for 04 . In §2 we shall show that there is no universal upper 
bound (independent of the number N of quantities in the distribution) for as. 
In fact we ^d the actual dependence of the maximum possible value of os as 
a function of N. The form of this function seems to be known but not to have 
been rigorously proved before. It then follows from ( 1 ) that there is no uni¬ 
versal upper bound for cu . 

1, The inequality ( 1 ). Let us consider the quadratic form 
(?(a, 6 , c) == + 2^106 + 2 v 2 ac + V 2 b^ + 2 i^c + 

= i\r^S(a + xb + 

It follows that G{a, ft, c) is a positive semi-definite quadratic form. In fact, 
if there are at least three distinct values of x, then 6(a, ft, c) is a positive definite 
form. Consequently, its discriminant 


VO 

Pi 

Pi 

Pi 

Pi 

Pz 

Pi 

Pi 

PA 


must be non-negative. There is no loss of generality in supposing that Pi = 0, 
s: 1, in which case we find that 

1 0 1 

0 1 08 0 . 

1 08 04 

Expanding the determinant, we get the inequality (1). 

We remark that equality holds in (1) if and only if there are only two distinct 
values of x, 

2. The maximum value of 0 $. It is clear that this maximum will be N'~^ 
times the maximum value of the function f{x) == Sx* on the bounded closed set 
consisting of those points x for which g{x) = Xx^ = N and h{x) = 2x = 0. 
According to the Lagrange multiplier rule, this latter maximum is obtained as 
follows. Let F{x) = f{x) — — /ife(x). Then the maximiaang point 

satisfies the relations 

F*,. = 3x? - 2Xx. - M = 0, Sx^ = N, Xx = 0. 

The equations SF, = 0, SxF* = 0 shows that /x = 3, /«« = 2N\/3, Solving 
the equation = 0 gives 

(2) X.- = [X + e<(X* + 9)*]/3, 

where e< = ±1. For these values of x< we shall have h{x) = 0, gix) == N ii 
and only if 


X = -(X* + 9)*iV"*2c. 
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Therefore X has the sign opposite to that of Se, ahd 

X*IiV* - (Sc)*l = 9(Se)’. • 

It follows that Se ±jV, and that 

(3) X-3Se/[JV^ - {Se)’]‘, 

/n..x = -2ArSe/[iV“ - (Sc)*]*. , 

We have still not obtained the maximum, however, since the minim um will 
also satisfy all of the relations deduced above. We distinguish the maximum 
from the other critical values by examining the function 

tf(S) = -2Nl/(N^ - S*)*. 

Since Sc ±N, e< = dbl, it is clear that iV — 2^Sc^2 — iV. We therefore 
consider S(S) on the interval (2 — N, N — 2). We find that 

de/dS = - 2 iVV(W* - S*)*'* < 0 , 

so that ^ is a decreasing function of S on the interval indicated. Its maxiinum 
value will therefore occur when 1 = 2 — N, and this maximum value wiU be 

6(2 -N) = N(N - 2)/(N - 1)*. 

I'he value Se = 2 — iV occurs only when one of the e<, say Ci, is equal to +1 
and all the rest are equal to — 1. Then we find from (3) and (2) that 

X = 3(iV' - 2)/2(W - l)i, 

(4) xi = (N - 1)*, xt = xs - Xn = -{N - 1)“*, 
a, = fix)/M = (AT - 2)/(i\r - D*. 

Since the maximum value of a» given by (4) approaches « with N, it follows 
that there is no universal upper bound for a*. More precisely, the quantity 
Os can be made as large as desired by choosing N large enough and then picking 
Xi as in the last paragraph. Since there is no universal upper bound for o^, 
it is clear from (1) that there is no universal upper bound for 04 . It would 
probably be possible, although rather difficult, to derive an explicit bound for 
ai as a function of N by using the methods employed above for 03 . 



NEWS AND NOTICES 

Readers are invited to submit to the Secretary of the Institute news items of general interest 

Personal Items 

Mr. Carl A. Bennett is doing war research on a project at the University of 
Chicago. 

Dr. George W. Brown, formerly Research Associate at Princeton University, 
is now at the RCA Laboratories in Princeton, New Jersey. 

Professor Harry Carver is on leave of absence from the University of Michigan 
to do Operations Analysis work with the Army Air Forces. 

Mr. George B. Dantzig is now Principal Statistician of the Army Air Forces 
Statistical Control Division at the Pentagon Building in North Arlington, Vir¬ 
ginia. 

Assistant Professor Preston C. Hammer of Oregon State College has taken a 
leave of absence to aid in setting up statistical methods of quality control for the 
Lockheed Aircraft Corporation of Burbank, California. 

Dr. Tjalling Koopmans is now associated with the Cowles Commission for Re¬ 
search in Economics at the University of Chicago. 

Dr. Jerome C. R. Li is now an instructor in mathematics at Queens College in 
Flushing, New York. 

Associate Professor Joe Livers of Montana State College has been granted a 
leave of absence for the summer and fall terms to study at the University of 
Michigan. 

Assistant Professor Eugene Lukacs of Illinois College, Jacksonville, Illinois, 
has accepted an associate professorship in mathematics at Berea College, Berea, 
Kentucky. 

Mr, R. I. Piper, who has been on leave from the Southern California Telephone 
Company to do war research at the California Institute of Technology, has re¬ 
turned to the Telephone Company as Plant Staff Assistant. 

Dr. Henry 8cheff6, formerly lecturer in mathematics at Princeton University, 
has been appointed to an assistant professorship in mathematics at Syracuse 
University. 

Assistant Professor H. M. Schwartz of the University of Idaho has been ap¬ 
pointed research fellow at the Bartol Research Foundation in Swarthmore, Penn¬ 
sylvania. 

New Members 

The following persons have been elected to membership in the Institute ; 

Bower, O. K, Ph.D. (Illinois) Asso. in Dept, of Math., Univ. of Illinois. 505 W. John, 
Champaign, Ill. 

Breden, Robert E. B.S. (Kansas State College) Personnel Technician, Statistical Serv¬ 
ices, Technical Section of the War Dept., 270 Madison Ave., New York, N. Y. 
Brizey, John C. Ph.D. (Chicago) Asso. Prof, of Math., Univ. of Oklahoma. 927 S. Pick¬ 
ard, Norman, Okla. 

Brown, Arthur B. Ph.D. (Harvard) Asst. Prof, of Math., Queens College, Flushing, N.Y. 
Bruyere, Martha (Mrs. Paul T.) Stat., U. S. Public Health Service, Div. of Venereal 
Diseases, Gaithersburg, Md. 
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Bnsyere. IHittl T. M.P.H, (Yale) Stat., U. S. Public Health Service, Bldg. T6, Betheedaf 
Md. 

Carter, Gerald C, Ph.D. (Purdue) Dept. Head, Naval Training School, Purdue XTniv. 
530 Garfield St., W. Lafayette, Ind. 

Casanova, Teobaldo. Ph.D. (New York) Res. Stat., Univ. of Puerto Rico, Rio Piedras, 
P. R. 

Chances, Ralph. B.B.S. (C.C.N.Y.) Stat., Industrial Surveys Co., 847 Madison Ave., 
New York, N. Y. 

Cody, Donald D. A.B. (Harvard) Res. Math., Res. Lab., Indianapolis Naval Ordnance 
Plant, Indianapolis, Ind. 

Duvall, George £• Asst. Physicist, UCDWR, XJ. S. Navy Radio and Sound Lab., San 
Diego 62, Calif. 

‘Ellis, Wade. Ph.D. (Michigan) Special Instr. in Math., Univ. of Michigan. 921 Wood- 
lawn, Ann Arbor, Mich. 

Field, Robert W. Ph.D. (Illinois) Asao. Prof, of Industrial Engineering, Purdue Umv., 
Lafayette, Ind. 

File, Quentin W. Ph.D. (Purdue) Instr. of Elec. Wiring, Purdue Naval Training School. 
Physics Bldg., Purdue Univ., Lafayette, Ind. 

Freeman, Albert M. Dir. Math. Lab., Boston Fiduciary Sc Research Associates. Neck 
Rd., Tiverton, R. I. 

Gerloui^, Daniel L. B.S. (Calif. Inst, of Tech.) Quality Control Engineer, Plomb Tool 
Co., Box 3519, Terminal Annex, Los Angeles 64, Calif. 

Germond, Hallett H. Ph.D. (Wisconsin) Asso. Prof, of Math., Univ. of Florida, Ciaines- 
ville, Fla. (On leave). 

Ghormley, Glen E. Stat. Analyst, Lockheed Aircraft Corp. 139 N. Chester Ave., Pasa¬ 
dena 4, Calif. 

Grant, Eugene L. A.M. (Columbia) Prof, of Economics of Engineering, Stanford Univ., 
CaUf. 

Gunlogson, L. S. B.B.A. (Minnesota) Ensign, USNR. 3616 18th Ave., So., Minneapolis 
7, Minn. ■* ' 

Hart, Alex L. Ph.D. (Minnesota) Asst, to the Dir., Res. Dept., Eastern Air Lines, Inc. 
141-24 79 Ave., Flushing, L, I., N. Y. 

Kefferstan, William F. Mgr., Economic Research, Boston Fiduciary Sc Research Associates, 
60 Congress St., Boston, Mass. 

Lyons, Will. B.Sc. (Bucknell) Economist, Gen. Statistics Staff, W.P.B. 2027 Park Rd., 
N.W., Washington 10, D. C. 

McBee, Ethelyne L. M.A. (Columbia) Stat., Dept, of Agric. 2126 N. Stafford St., 
Arlington, Va. ^ 

McIntyre, Donald P. M.A. (Toronto) Meteorologist in Charge, Prince Geoige Aiiport. 
Box 296, Prince Geoige, B, C., Canada. 

Moss, Judith. B.A. (Vassar) Jr. Math., Stat. Res. Group, Div. of War Research, Columbia 
Univ. 319 St. John’s PI., Brooklyn 17, N. Y. 

Oosterhof, Willis M, M.A, (Michigan) Stat., Mich. State Dept, of Social Welfare. 
811 Hackett St., Ionia, Mich. 

Piper, Robert I. B.A. (Montana) Res. Asso. Room 204, Astrophysics Bldg., Calif. Inst, 
of Tech., Pasadena 4, Calif. 

Robbins, Herbert £. Ph.D. (Harvard) Lt., USNR. Post Graduate School, Annapolis, 
Md. 

Seeley, Sherwood B. Ch.E. (New York) Dir., Res. and Tech. Div., Joseph Dixon Cru¬ 
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REPORT ON THE WELLESLEY MEETING OF THE INSTITUTE 

The Seventh Summer Meeting of the Institute of Mathematical Statistics was 
held at Wellesley College, Wellesley, Mass., on Saturday and Sunday, August 12 
and 13,1944, in conjunction with the meetings of the Mathematical Association 
of America and the American Mathematical Society. The following 51 members 
of the Institute attended the meeting: 

T. W, Anderson, H. E. Arnold, K. J. Arnold, L. A. Aroian, A. L. Bailey, J. L. Barnes, 
C. I. Bliss, A. H. Bowker, B. H. Camp, C. W. Churchman, W. G. Cochran, T. E. Cope, 
J. H. Curtiss, W. E. Deming, P. S. Dwyer, Will Feller, C. D. Ferris, R. M. Foster, 
H. A. Freeman, Henry Goldberg, E. J. Gumbel, P. R. Halmos, Harold Hotelling, Truman 
Kelley, L. R. Klein, Myra Levine, John Mandel, J. W. Mauchly, Richard v. Mises, 
E. B. Mode, Vaclav Myslivec, P. M. Neurath, M. L. Norden, C. O. Oakley, P. S. 01m- 
stead, Edward Paulson, Wm. Reitz, S. L. Robinson, F. E. Satterthwaite, Henry Scheff6, 
W. A. Shewhart, Andrew Sobszyk, H. W. Steinhaus, Marian Torrey, Mary Torrey, A. W. 
Tucker, J. W. Tukey, Abraham Wald, R. M. Walter, Elizabeth Wilson, Jacob Wolfowitz. 

The first session was held jointly with the Mathematical Association and con¬ 
sisted of a Symposium on ^Totential Opportunities for Statisticians and the 
Teaching of Statistics.^’ The President of the Institute, Dr. W. A. Shewhart, 
presided. The principal addresses were made by Dr. Shewhart and Professor 
Harold Hotelling. Remarks were also made, upon invitation of the Chairman, 
by Prof. Milton de Silva Rodrigues of Sao Paulo University in Brazil who is 
spending a year in studying methods of teaching statistics in this country, and by 
Dr. Vaclav Myslivec, Czechoslovak Delegate to the United Nations Interim 
Commission on Food and Agriculture. A lively discussion was imder way when 
time forced the conclusion of the meeting. There was continued discussion by a 
smaller group for some time afterwards. 

Professor B. H. Camp acted as Chairman at the Sunday morning session, a 
contributed papers session held jointly with the Association. The following 
papers were presented: 

1. Statistical Tests Based on Permutations of the Observations. 

A. Wald and J. Wolfowitz, Columbia University. 

2. Error Control in Matrix Calculation, 

F. E. Satterthwaite, Aetna Life Insurance Co. 

3. On Cumulative Sums of Random Variables, 

A. Wald, Columbia University. 

4. The Approximate Distribution of the Mean and of the Variance of Independent Vari¬ 
ates. 

P. L. Hsu, National University of Peking. (Introduced, and presented, by W. Feller, 
Brown University). 

5. Ranges and Midranges. 

E. J. Gumbel, New School for Social Research. 

6. Statistics of Sensitivity Data, II. Preliminary report. , 

C. W. Churchman, Frankford Arsenal and Benjamin Epstein, Westinghouse Electric 
and Manufacturing Co. 
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Fresideat Sbewhart presided at the Sunday aftonoon W8«(m. The 
ing invited addresses‘Were given: , 

■ 1. The PrcAlem on ToUranee Limita. 

Lt. J. H. Curtiss, USNR. ^ 

Some Improveniente in Weighing and Other Experimental Techniquee. 

H. Hotelling, Columbia University. 

A business meeting was held at the conclusion of the Sunday afternoon sessimi. 
The Secretary-Treasurer made a brief report dealing with (1) the financial condi¬ 
tion of the Institute and (2) the membership growth of the Institute. The Pres¬ 
ident, reporting for the Editor, indicated a need for more papers for die next 
two or three issues of the Annals. The Institute, after some discussion, then 
passed two Amendments to the Constitution and four Amendments to the By- 
Laws. These Amendments are listed in the following section. A resolution 
thanking the officials of Wellesley College was passed. 

A dinner for the three mathematical organizations was held Sunday evening. 
Addresses were made by Captain Mildred H. McAfee and Professor Marshall H. 
Stone. Later in the evening there was a musicals featuring David Barnett. 

P. S. Dwtbb 
Secretary 



AMENDMENTS TO THE CONSTITUTION AND BY-LAWS 
OF THE INSTITUTE 

The following Amendments to the Constitution and By-laws were passed at 
the business meeting at Wellesley College on August 13,1944. The votes pf all 
voting members who sent ballots to the Secretary-Treasurer prior to the time of 
the meeting were counted in the balloting. The Amendments as adopted are 
identical with the proposed Amendments which were placed in the hands of the 
membership in July: 


Amendments to Constitution 

1. Article III. 3. The hrst sentence, which was 

^The Institute shall have a Committee on Membership composed of three Fellows. 

shall be revised to read: 

*The Institute shall have a Committee on Membership composed of a Chairman and 
three Fellows.” 

2. Article IV. 3. The first two sentences, which were: 

”The Committee on Membership shall hold a meeting immediately after the annual 
meeting of the Institute. Further meetings of the Committee may be held from time to 
time at the call of the Chairman or any member of the Committee provided notice of such 
call and the purpose of the meeting is given to the members of the Committee by the 
Secretary-Treasurer at least five days before the date set therefor.” 

shall be revised to read: 

'^Meetings of the Committee on Membership may be held from time to time at the call 
of the Chairman or any member of the Committee provided notice of such call and the 
purpose of the meeting is given to the members of the Committee by the Secretary- 
Treasurer at least five days before the date set therefor. Committee business may also 
be transacted by correspondence if that seems preferable. 

Amendments to By-Laws 

1. Article I. 4. Add the following sentence: 

”The power of election to the different grades of Membership, except the grades of 
Member and Junior Member, shall reside in the Board.” 

2. Article I. 5. which was: 

**The Committee on Membership shall prepare and make available through the Secre¬ 
tary-Treasurer an announcement indicating the qualifications requisite for the different 
grades of membership.” 

shall have added the following sentences: 

*^The Committee shall review these qualifications periodically and shall make such 
changes in these qualifications and make such recommendations with reference to the 
number of grades of membership as it deems advisable. The power to elect worthy 
applicants to the grades of Member and Junior Member shall reside in the Committee, 
which may delegate this power to the Secretary-Treasurer, subject to such reservations 
as the Committee considers appropriate. The Committee shall make recommendations 
to the Board of Directors with reference to placing members in other grades of member¬ 
ship. The Committee shall give its attention to the question of increasing the number 
of applicants for membership and shall advise the Secretary-Treasurer on plans for that 
purpose.*! 
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3. After Article XL 1 (ft) Sxceptloiu Add: 

**(b) Exeeption, Any Member or Fellow nuiy mftke ft single payment whieh will be 
accepted by the Institute in place oi all succeeding yeariy dues a^ whidi will not other¬ 
wise alter his status as a Member or Fellow. The amount of this payment will depend 
upon the age of this Member or Fellow and will be based uik>n a suitable mortality t^de 
and rate of interest, to be f^cified by the Board of Directors/* 

and 

4. **(c) Exception. Any Member or Junior Member of the Institute serving, except as 
a commissioned officer, in the Armed Forces of the United States or of one of its allies, 
may upon notification to the Secretary-Treasurer be excused from the payment of dues 
until the January first following his discharge from the Service. He shall have aU privi¬ 
leges of membership except that he shall not receive the Official Journal. However during 
the first year of his resumed regular membership he may have the right to purefaase, at 
12.50 per volume, one copy of each volume of the Official Journal published during the 
period of his service membership.*- 



ABSTRACTS OF PAPERS 

Presented on August 13,1944, at the Wellesley meeting of the Iiutitute. 

1. Statistical Tests Based on Permutations of the Obsenrations. A. Wald and 
J. WoLFowiTZ, Columbia University. 

It was pointed out by Fisher that statistical tests of exact size, based on permutations 
of the observations, can be carried out without assuming anything about the underlying 
distributions except their continuity. Scheff6 has proved that, for an important class of 
hypotheses, t hese tests are the only ones with regions of exact size. Texts based on permuta¬ 
tions of the observations have been constructed by Fisher, Pitman, Welch, and the present 
authors. In the present paper, the authors prove a theorem on the limiting normality of 
the distribution, in the universe of permutations, of a class of linear forms. Application 
of this theorem gives the limiting normality (in the universe of permutations, of course) 
of the correlation coefficient, and of a statistic introduced by Pitman to test the difference 
between two means. The limiting distribution of the analysis of variance statistic in the 
universe of permutations is also obtained. 

2. Error Control in Matrix Calculation. Franklin E. SArrERTHWAiTE, Aetna 
Life Insurance Co. 

The arithmetic evaluation of matrix expressions is often rather complicated. One of the 
catises of this is the fact that relatively minor errors (such as rounding errors) introduced 
in an early step may be magnified to such an extent in succeeding steps that the final result 
is useless. Iterative methods to meet this difficulty have been reviewed very completely 
by Hotelling. In this paper a different approach is taken. Conditions on the norm of a 
matrix are determined so that a Doolittle process will not magnify errors to more than two 
or three decimal places. It is then pointed out that if an approximation to the inverse of 
the matrix is available, most problems can be rearranged so that the required norm conditions 
are met. A Doolittle process may then be used to any number of decimal places with as¬ 
surance that errors will not accumulate to more than a limited number of decimal places. 

3. On Cumulative Sums of Random Variables. A. Wald, Cvolumbia University. 

Let |z<| (t « 1,2, • • • ad inf.) be a sequence of indep>cndent random variables each having 
the same distribution. Denote by Zj the sum of the first j elements of the sequence. Let 
a > 0 and 6 < 0 be two constants and denote by n the smallest integer for which either 
Zn'^CLOv Zn <b. Neglecting the quantity by which Zn may differ from o or 6 (this can be 
done if the mean value of \zi\ is small), the probability that Zn > c iot c ^ a and c ^ b 
is derived, and the characteristic function of n is obtained. The probability distribution of 
n when Zi is normally distributed is derived. These results have application to various 
statistical problems and to problems in molecular physics dealing with the random walk of 
particles in the presence of absorbing barriers. 

4. The Approximate Distribution of the Mean and of the Variance of Indepen¬ 
dent Variates. P. L. Hsu, National University of Peking. 

Let Xk be mutually independent random variables with the same cumulative distribution 
function; let E{Xk) — 0, E(Xl) »» 1 and E(Xk) «. Finally put S • n“i X* and 

tr “ n“i £*-1 "^he author first gives a new derivation of H. Cramer’s well- 

known asymptotic expansions for Pr (n*S < x). The proof is much more elementary and 
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avoids in particular the use of M, Riess’ singular integrals. Instead a oonsiderahly 4in^ler 
Cesaro-type kernel is used, which has first been introduced by A. C. Berry (TVons, Aater. 
Math, Soc. 49 ( 1941 ), pp. 122 - 136 ). The same method is then used to derive similar asymp¬ 
totic expansions for Pr (n* (17 - 1) < (3-1 )*x) . The method can be extended to the case 
of unequal components and also for the study of other functions encountered in mathematical 
statistics. 

5. Ranges and Midranges. E. J. Gumbel, New School for Social Research. 

The mth range Wm and the mth midrange Vm are defined as the difference and as the 
sum of the mth extreme value taken in descending magnitude (‘Trom above**) and the 
mth extreme value taken in ascending magnitude (“from below**). The semi-invariant 
generating functions Lm(0 and mL(t) of the mth extreme values from above and below are 
simple generalisations of the semi-invariant generating functions of the largest and of the 
smallest value which have been given by E. A. Fisher and L. H. C. Tippett. If the sample 
size is large enough the two mth extreme values may be considered as independent variates. 
Then, the semi invariant generating functions L^(t, m) and /),((, m) of the mth range and 
of the mth midrange are 

m) » Lm{t) -h «,L(—<); !/„(<, m) ■■ Lm{i) -h mLif), 

If the initial distribution is symmetrical the semi invariant generating function of the mth 
range is twice the semi invariant generating function of the mth extreme value from above. 
The distribution of the mth range is skew, whereas the distribution of the mth midrange is 
of the generalized, symmetrical, logistic type. The even semi invariants of the mth midrange 
are equal to the even semi invariants of the mth range. For increasing indices m the distri¬ 
butions of the mth extremes, of the mth ranges and of the mth midranges converge toward 
normality. 

6. Statistics of Sensitivity Data, II. Preliminary report. C. W. Churchijan, 
Frankford Arsenal, and Benjamin Epstein, Westinghouse Electric and Manu¬ 
facturing Co. 

In this paper a study is made of the distribution of the first two moments of sensitivity 
data as functions of the sample size. The chief results are briefly these: 

(a) The distributions of t and <r| (for definition of these functions, see '* 0 n the Statistics 
of Sensitivity Data,** by the authors in the Annals of Mathematical Statistics, Vol. 
XV, No. 1) approach normality rapidly as functions of the sample size; 

(b) i and al are '^almost** independent even for small sample sizes, thus justifying the 
use of Student *s ratio in texts of significance for differences between two sample 


means. 




SOME EXTENSIONS OF THE WISHART DlSTTaBUTION' 

T. W. Anderson and M. A. Girshick 

Princeton University and Columbia University 

1. Introduction. The well-known Wishart distribution is the distribution 
of the variances and covariances of a sample drawn from a multivariate normal 
population assuming that the expected value of each variate remains the same 
from observation to observation. For problems such as testing collinearity [1], 
comparing scales of measurement [2], and multiple regression in times series 
analysis [3], it is desirable to have the distribution of sample variances and co- 
variances for observations, the expected values of which are not all identical. 
Such a distribution could be considered as a generalization to several variates 
of the x'* (non-central x*) distribution, as well as a generalization of the Wishart 
distribution to the non-central case. In this paper we shall discuss the general 
problem of finding the distribution in question and shall derive this distribution 
for two particular cases. We shall start out with the problem in its most general 
form and as a result of linear transformations express the distribution as a certain 
multiple integral. 

We can think of the expected values of the observations as defining points in a 
space of dimension equal to the number of variates. If these points lie on a line, 
the non-central Wishart distribution is essentially the Wishart distribution 
multiplied by a Bessel function; if the points lie in a plane, it is a Wishart distri¬ 
bution multiplied by an infinite series of Bessel functions. For higher dimen¬ 
sionality the integration of the multiple integral becomes extremely troublesome; 
it has not been possible yet to express the general integration in a concise form. 
These results are summarized precisely at the end of the paper. 

2. Reduction to canonical form. Consider a set of N multivariate normal 
populations each of p variates. Let the ith (t = 1, 2, • • • , p) variate of the ath 
(a = 1,2, * • • fN) population be Xia ; let the mean of this variate be 

(1) E{xia) = Mi« (f = 1, 2, »• • , p; a = 1, 2, • • • , AT); 

and let the variance-covariance matrix (of rank p) common to all N distributions 
be 

II E[{Xia - flia)iXja - M/«)] || = || || (of = 1, 2, • - , N). 

Now consider a sample of observations {a;,*} one from each population. 

The purpose of this paper is to find the joint distribution of the quantities 

N 

( 2 ) aij = (Xia Xj)f 

a-1 

1 The results given below were arrived at independently by the two authors. Some 
preliminary results were given before the Institute of Mathematical Statistics at Wash¬ 
ington, D. C., May 6,1944, by Girshick. 
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where 




1 


To simplify the notation in the subsequent work we treat the quantities a</ 
instead of the sample variances and covariances which are simply multiples 
(by 1 /(N — D) of the a*,-. 

The tti,* may be considered as sums of squares and cross products, for there 
exists a linear transformation^. 

N 

Xia, = 53 (i * 1» 2, • • • , p)| 

where the matrix || || is orthogonal (and Bm — But - • • • — Bifs - l/y/N)^ 

such that 


n 

(^ij ^ XiaXfa 9 
a-L 

where n ~ AT — 1 and 

== XiifX^jif . 

For a given a the Xia have a multivariate normal distribution with the same 
variances and covariances as the x^& and with expected values 

iV 

E(xi„) = 2 , say. 

(1-1 

Let 

n 

r<y - 53 • 


Then it is clear that the rij are the same functions of the that the are of 
the a;^s, namely, 


(3) 

where 


T.-y = 53 (M»a — yi){yia ~ My), 

a-il 



iV «-i 


Now consider the two p by p matrices 


S = ||<r«,| 

and 

T = ||r<,| 


* See, for example, [4].. 
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Ki , K* I " ■ , Kp be the teal, non-negative roots ci the determinantal eqiution 
(4) 1 T - XS 1 - 0. 

There exists a non-singular p by p matrix 


(6) 

such that' 

^ = 



(6) 

f'S'it'' = I 


and 


Ki 0 • • • 

0 



0 kI • • • 

0 

(7) 

'i'T’J'' = 




0 0 • • ■ K p 


where I is the identity matrix and 4^' is the transpose of 4'. Suppose the rank 
of T is t; then t of the roots are non-zero and p — t are zero. For the sake of 
convenience we shall choose k* , k* ,*••,«« to be the non-zero roots. If T is of 
rank t, then the means ju<a lie in a < dimensional sub-space of the original p 
dimensional space. Let us make the transformation 

P 

( 8 ) = E • 

7-1 

The Zia are normally and, because of relationship (6), independently distributed 
with unit variances. The mean value of Zia is 


/-I 


say. As a result of (7) 

(9) 2 (^ = Ij 2, • • •, p), 


( 10 ) 


n 


E 


Via Via = 


0 


(i 5^ J). 


Let the new sum of squares of cross-products be 

n 

( 11 ) bij = ^ Zia Z^a • 

a->l 

We shall first find the joint distribution of the bn and then obtain the distribution 
of the an by using the fact that the bn can be considered simply as a linear 


* See, for example, [5]. 
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transformation of the a</ in a + 1) dimensional space. For we can write 
5</as 

(12) 4^iKy^ikXhm^ka =* ]C ^ih^ik<ihk • 

a-il Khml ^fe-1 

The transformation (8) is performed on all variates of each observation. The 
next transformation, which is the one that results in the canonical form of the 
problem, is performed on all observations of each variate. We wish to construct 
the n by n matrix of this transformation 

^ 

in the following manner: Let 

“ ““ (® ~ 1>2, *•* y fj 1, 2, • * • ft')* 

In view of (9) and (10) 

ft 

^ , (f> 1? “ If 2, • • • , 0 

a^l 

where is the Kronecker delta. The remaining elements in ^ are chosen in 
any way to make ^ orthogonal. 

Now make the transformation 

t/ia = 2 (^ — 1, 2, • • • , p; ot =» 1, 2, • • • , n). 

^-1 

Because $ is orthogonal, 

n 

(13) &«= 2 

a—1 

and the y^B are independently normally distributed. By virtue of the construc¬ 
tion of and the properties of 11 11 the expected value of each yia is zero except 

for t of the variates, namely, 

■®(v*w) ” (^ ~ 1> 2, • • • , ^) ^ 

Now the problem can be put in this form: Find the distribution of 6,/ (given by 
(13)) when the distribution of the y^B (in the canonical form) is 

1 S S 

A *-1 «-l 

(27r)*^/2 

where ki , k 2 , • • • , are different from zero. 

The solution of our problem can be expressed as a certain multiple integral of 
iy variables. Let 

n 

— iC yutVi*' 

a-t+t 



m 


SXXBNSIONS or WISEAST OZaSRBIBtmOK 

Smoe the btt have the Wishait distribution witb n — t degnet ci freedom (yn 
assume n ^ t + p) we can write the joint distribution of the b'n and (t » 
1,2, • • • , p; ij * 1,2, • • • , 0 as 

1 -i h <>'n 

^ + 1 _ i]) 


Considering the equations 


X 


1 S S 

_±_ a 


I 

bif = bii + ]C VinVin 

9-1 


as a transformation of the bii we immediately obtain the joint distribution of the 
6 ’s and the = 1 , 2 , • * • , p; 17 = 1 , 2 , • • • , 0 as 


, n-i 

(14) ]ft r(i[n 

P « 

2 ) 2 

Xe '■* 

To find the distribution of the bn we must integrate out the , where the 
range of integration is such that the matrix 

II bn - E yinVf, II 

9»1 

is positive. For f = 1 or 2 we can integrate (14) and express the results in a 
convenient form. However, for higher values of t the integration affords con¬ 
siderable difficulty and has not been done for the general case. In terms of 
geometry the case f = 1 is the case in which the expected values of the observar 
tions lie on a line in the p dimensional space. In the case of i = 2, similarly the 
expected values lie in a plane in this space. Hence, we shall call these two cases 
the linear and planar cases, respectively. 

3. The Hwear case. In the linear case there is one root of the equation (4) 
which is not equal to sero, that is, there is simply one k in the distribution (14) 
and one set of p's, namely p«(i = 1, 2, • • • , p). The problem is to integrate the 
ya over the range for which the matrix 

II bn — Vayn || 


- «-t- 1 - *]) 
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is positive; the integrand we are interested in is (dropping the subsmpt “1” 
from the ki and 2^.1 and neglecting the part not involving the y’s) 

(15) 

«>wl 

The determinant in expression (15) can be expressed as 


5« - J/y i = 1I (l “ Vjj . 


where 


= \\bu\\-\ 


The inverse exists because the probability is zero that || 6t, || is singular. There 
is a linear transformation 


such that 


Vi =T, 9.7%: 
y-1 


i—I 


Kyi = lUi , 

where t is the one non-zero root of the equation: 

(16) I - K^En I = 0, 

where = \\b'^ \ \ and En is the matrix with unity in the upper left hand comer 
and zeros elsewhere. This fact is a result of the well-known theorem concerning 
diagonalization of pairs of quadratic forms/ The Jacobian of this transformation 
is 

I 9 <f I = I 5 i/ 1 *, 

V 

and the range of integration is 2 < 1. The integrand is transformed into 


I i(n-p-l) 




P \i(»—P--2) p 


Now let 


Ml = Sin w, 

Mj = cos Ml Vi -1 


(t = 2, 3, • • • , p) 


* The transformation is the so-called ^^regression transformation.” See Madow [6]. 
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3S1 


The Jacobian of this transformation is cos’ v> and 

1 — ^ tt* = cos* w ^1 — ^ • 

The integration is over the ranges oi'^- < w <~ and 

t-1 

We integrate the following expression 

K p~l -I 

1 - Z t'H n {cos*^ toc'•*“” dto}. 


The integral of the quantity within the brackets is simply a Dirichlet integral [7] 
and its value is 


r(§[n - 
r(Mn - 1]) 


The integral of the expression within the braces is a multiple of a Bessel function 
of purely imaginary argument [8, p. 79]; that is, 


r(i[n - l])\/tr 




Hence, the integral of (15) is 

(17) I bii |*'"-'-‘>r(J[»t - p])x‘’(l/2)-*'"-*>7i(n-*,(i). 

Multiplying equation (16) by the determinant |B| one can easily show that the 
non-zero root, Z*, is simply k^h , The distribution of the b*/ in the linear case 
then is 


——i X ^ii 

e e 





5.71 “’‘"’^”(**5ii)““"’^>/4(»-»(*V5ri), 


In §6 we shall give the distribution in terms of the original variables, namely, 
the CLij • 


4. The planar case. The case of two non-zero roots of equation (4) can be 
handled by continuing the process of integration of §3 another step. The 
essential problem is the integration of 

(18) 1- Z VinVi. 1 fr n dy,, 

ir»i 1 
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over the range of j/’a for which the matrix 

II- T,Vi%Vh\\ 

is positive. The integration is done in t^vo stages, first with respect to the yn , 
then with respect to the ym . Letting 

hij == hij — y%^2i , 
n as n — 1, 

K = fCi , 

= yn , 

and omitting for the time being 

(19) c*’"’* jft , 

we can write the first stage of the integration of (18) as 

f |6«- §< Si I 

over the range || 6o* ^ y<5i II positive. But, the only difference between this and 
the integration of (15) which has been shown to be (17), is that we are now writing 
all variables with signs. Keeping this in mind, changing back again to 
our other variables of Section 4 and inserting again (19) we can write the first 
stage of the integration of (18) as 

r(i[n - P - l])7r*'’|6« - I ~ ^*^^ 1 

(20) ■-_, 

• I i(n- 8 ) (\/Ki bn — Ki 2/12) IT . 

Now we must integrate (20) with respect to the yi 2 over the range || ha ~ 2 /< 22 /j 2 || 
positive. The determinant in (20) can be written as 

|5.vl(l - VitVi^ • 

There is a transformation 

p 


(21) 

yt2 — 2Z 9ii^i > 

i-i 

such that 

h*’ynyfi = «*, 

KlVXt — t 

(22) 


«2 yn — di + di S2 ) 
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where/* is tile one non-iaro root of tin equation 

I - k!Eu I - 0, 

where and Eu are used as in equation (16). Since this equation 
is similar to (14), then /* = xtbu. The values of di and dt will be considered 
later. This result is deduced from an extension of the theorem oonoeming tiie 
diagonalisation of pairs of quadratic forms.* The Jacobian is 

I ffo I “ I bii I*, 

and the range of integration is «f < 1. The integrand (20) is now changed to 

( P 

1 - g e****^*'* 

Next the following transformation is made: 


«! = sin t£?i, 

8% = cos t&i sin , 

Si = cos Wi cos (t = 3, 4, • • • , p)* 


The Jacobian is cos*" Wi cos*^^t£?j, and 

1 — cos* cos* W 2 . 

We now integrate 

1 - g j n d»ij 

(23) -jcos"^ Wi cos"-* 0 cos in)”*'"”*’ 

• Jl(i.-«(/008 Wx) dwt diCsj. 


The integral of the expression within the square brackets is another Diriohlet 
integral; its value is 


r(i[» - 
r(Mn - 2]) • 


* Again the '^regreesion transformation” is used. See footnote for Section 3. 
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Similar to Section 3 the integration of the quantity within the braces with respect 
to is 

r(J[n - 2])y/^ {di cos lOi) 

/ t \ —J(n—8) 

•C08“~* ICiC**** “"‘fl COS M»i) lUn-nif COS Wi)- 

Since the range of integration of Wi is — t/2 < < t/2 and since sin Wi is an 

odd function and cos wi is an even function, the integral of the above expression 
can be transformed into an integration over the range 0 < < t/2 by replacing 

^di Bin {jy 2 sinh (di sin wi). In view of the relationship between the Bessel 
functions of purely imaginary argument and sinh {di sin wi) [8, p. 54] we can write 
the integral of the above expression as 

( j \-i(n-8) _ _ 

I j Vdi V2ir 

prl2 

X / sin Wi)I^(n-^y{d 2 cos iOi)J|(n-8)(/ COS Wi) sin i Wi cos Wi dwi . 


This integral can be expressed in another form by virtue of a formula in Watson's 
Bessel Functions [8, p. 377] as 

^24^ Vi r + dl +f - 2dif cos u ) 

^ ’ •'« (dl + dl+f - Mij cos 

Letting + f = x and (kf = y and using an expresion formula for Bessel 

functions [8, p. 140] we can write (24) as 

Vi 2«"-« r t (Vi) sin"-* « du. 

Jq 7-0 7 i 

Since the integral of cos'*' u sin"“®w where y is odd is zero, the result of the integra¬ 
tion (using the ‘'duplication formula" for T functions and letting y = 2cu) is 

* —i(n—2)+2« 

(y) X 


r(i[n - 2 ])t2' 




‘^l(n-2)+2« (Vi). 


From the relationship (22) it is clear that the equation 

(26) I 1 = 0 (*t5 = 0 for i = 3, 4, 

is transformed by (21) into 


, P) 


(26) 


+ dl — X didi 


didt 

0 


0 

<^2 — ^ 

0 -X 


0 

0 

0 


= 0 . 
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RKpression (25) is equivalent to 

(27) I kjefitf — XJ«/1=0. 

Hmoe, X and y*, which are the sum and product, respectively, of tiie non-aero 
roots of the two equivalent equations (26) and (27), are given by 

+ <5 + d| = Kiba + «c|5a, 

y* = /*<^ = «!*i( 5 ui»« - bit). 

In view of this result we can now write the integration of (23) as 

r(Hn - p - il)r(M» - p])T’2*<"-«|h«|*‘"‘'~" x £ 

•(«!6ii + i<lb«)-“-*'*^’l,c_«+a.(\/«!5u + 46a)- 

Finally, by multiplying in what was left out of (18) we obtain the integral of (14) 
which is the solution to the problem as stated in the canonical form: 


(28) 


Jj r(|[n - 1 - tl) 


<-1 
2 2 / 


“-Va,!r(^l + u,) 


X ■rj(»-2)+le*(V^Xi 6u 4" Klbist), 


6« Final form. To answer the problem as stated originally it is necessary to 
make the transformation (12) and obtain the distribution in terms of the an for 
the linear and planar cases. 

It is clear that equation (25) is equivalent to 

(29) I T - X.4“' I = 0, 

where A"^ = || a.y ||~\ for T is the transform (by (5)) of || K^dn [| and A“^ is the 
transform of H bi? 1|. The sum and product of the non-zero roots, which are the 
arguments of the infinite series in (24), remain unchanged. 

Since the quantity ^ Ki is the sum of the roots of (4) it can be expressed as 

^ iC? = 23 - My), 

or-il 

where || <^‘^ |1 = || o’<j IP^ Furthermore we have 
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and 


£ hi -> tr (I bn II “ tr (’i'll an ||'»0 ® i 


iml 




Moreover, the Jacobian of the transformation (12) is* 
(30) |/| = l'*' 


I P+1 


|-J(P+1) 


Henc^, we have the following results: 

Given N multivariate normal populatuma each of p variates with identical variance- 
covariance matrices || || and with expected values of the pN variates Xia given 

by (1). Let a^ be defined by equation (2); let the rank of the matrix || ra || defined 
by (3) be t. 

(i) When t = 0, the joint distribviion of the aij is given by the Wishart distribution. 

(ii) When t = 1, the joint distribution of the an is 

p If 


e M-i-i_I |*<* ” I an T” 


P~1 


2tP(v-i)-j(i»r-.)^lp<,-i) - 1 - i]) 

X .'i-'""' f t t »«0.» fc)] 

a-1 J 

X /|<Ar-») 2^ 23 «.i(M<a — Jii){l>'}u — /i>)\ 

\ r a,/-I a.l / 

(iii) When t = 2, the joint distribution of the an is given by 

r* lyy!«.-»I pc.-.-*) 

g _ 2 - tl) 


"I—i(JV—•) 


(32) 


-1 S »•■'»</ ^ 

X e X E 


(%«*)“ 


2*"«ir + «) 


(«i + 


X /i(iV-«)+2«(\/wi + ^ 2)1 


where ui and Ui are the two non-zero roots of (29). 

(iv) When t > 2, the joint distribution of the o<y can be written by means of 
expression (14) as a multiple integral. The explicit form of the distribution has not 
yet been obtained. 


* One method of demonstrating this fact is to apply (8) to centrally distributed variates 
and compare the Wishart distribution of the transformed variates with the Wishart dis¬ 
tribution of the original variates. 
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STATISTICAL TESTS BASED ON PERMUTATIONS OF THE 
OBSERVATIONS 

A. Wald and J. Wolfowitz 
Columbia University 

1. Introduction. One of the problems of statistical inference is to devise exact 
tests of significance when the form of the underlying probability distribution is 
unknown. The idea of a general method of dealing with this problem originated 
with R. A. Fisher [13,14]. The essential feature of this method is that a certain 
set of permutations of the observations is considered, having the property that 
each permutation is equally likely under the hypothesis to be tested. Thus, an 
exact test on the level of significance a can be constructed by choosing a propor¬ 
tion a of the permutations as critical region. In an interesting paper H. Scheff4 
[2] has shown that for a general class of problems this is the only possible method 
of constructing exact tests of significance. 

Tests based on permutations of the observations have been proposed and 
studied by R. A. Fisher, E. J. G. Pitman, B. L. Welch, the present authors, and 
others. Pitman and Welch derived the first few moments of the statistics used 
in their test procedures. However, it is desirable to derive at least the limiting 
distributions of these statistics and make it practicable to carry out tests of 
significance when the sample is large. Such a large sample distribution was 
derived for a statistic considered elsewhere [1] by the present authors. 

In this paper a general theorem on the limiting distribution of linear forms in 
the universe of permutations of the observations is derived. As an application 
of this theorem, the limiting distributions of the rank correlation coefficient and 
that of several statistics considered by Pitman and Welch, are obtained. In the 
last section the limiting distribution of Hotelling's generalized T in the universe 
of permutations of the observations is derived. 


2. A theorem on linear forms. Let = (/ii, /12 ... , M 2, ... , ad 

inf.) be sequences of real numbers and let 

UriHy) = AT* i: ((ft. - Z ht)' 

a-1 \ / 


for all integral values of r. We define the following symbols in the customary 
manner: For any function f(N) and any positive function <p(N) let f(N) = 
0{ip(N)) mean that \f(N)/<p{N) \ is bounded from above for all N and let 


mean that 


/(AT) = fi(^(Ar)) 
/(AT) = Oiipm 
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and that 
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Also let 
mean that 


f{N )« 

lim f(N)MN) = 0. 


Let [p] denote the largest integer < p. 

We shall say that the sequences Hjf(N = 1,2, • • • , ad inf.) satisfy the condition 
W if, for all integral r > 2, 


( 2 . 1 ) 

For any value of N let 


tlr(Hs) 


= 0 ( 1 ). 


X = (xi, X2, •.. , a:jv) 


be a chance variable whose domain of definition is made up of the Nl permutations 
of the elements of the sequence Aif = {at, (h, • • •, fljv). (If two of the a»(i = 
1,2, • • • , i\r) are identical we assume that some distinguishing index is attached 
to each so that they can then be regarded as distinct and so that there still are N\ 
permutations of the elements ci, • • • , a^). Let each permutation of An have 
the same probability Let E{Y) and a^{Y) denote, respectively, the 

expectation and variance of any chance variable F. 

We now prove the following: 

Theorem. Let the seqicences An = (ni , ^2 , • • •, a^r) and Z)jv = (di , ^2, • • • , d^r) 
{N == 1, 2, • • • , ad inf.) satisfy the condition W, Let the chance variable Ln 
be defined as 

iV 

Ln ~ di Xi . 

«..i 


Then as N the probability of the inequality 

Ln E{Ln) K t a (Ln) 


for any real t, approaches 



For convenience the proof will be divided into several lemmas. 
Since 


♦ __ Ln E(Ln) 


remains invariant if a constant is added to all the elements of Dn or of An, or 
if the elements of either of the latter are multiplied by any constant other than 
zero, we may, in the formation of LjJ, replace An and Dn by the sequences As 



360 


A. WALb AND J. WOLIOWm 


and D'h, respectively, whosetth elements a'i and di(i * 1,2, •• • ,iV) are, respec¬ 
tively 

(2.2) a'i = [w(4«)l"* (oi - iV* g 

and 

(2.3) d’i - lMl(/)i»)]"* (di - JV“‘ g dy) . 

The sequences A'k and D'k satisfy the condition W. Furthermore, 

(2.4) iUx(A*) - w(Z)y) » 0 
and 


(2.6) 

= fniDfi) * 1. 


Lemma 1. 



(2.6) 

Z •• • Z • • • ol* - 

«l<«*<***<«*^^ 

0(1^“'*') 

(2.7) 

Z - Z dl.dl. • • • dl* = 



From (2.4), (2.6), and the fact that the A', and Dy satisfy condition W, it follows 
that the A'k and D'h satisfy conditions a), h), and c) of the theorem on page 383 
of [1]. Our lemma 1 is the same as lemma 1 of [1]. 

Lemma 2. Let 


V = (t»i, t),, ... , t»w) 

be the same permutation of the elements of Alf that X is of the dements of An. Let 
y = vi-- VkZ where z = wU+u • • • v\i+r),ij> 1 (; = 1,2, ■ • • ,r),andk,r,ii, • • •, 
i, are fixed values independent of N. 

Then 

(2.8) E{y) = 0(iV‘*"'-*). 


This is Lemma 2 of [1]. 

In a similar manner we obtain that 


(2.9) 


23 ■ ■ • £ do, • • • do»do(*»+,) 

«l>as.«**,a(k+r) 


= 0(iV“'*'''*)-0(iV*'^) 




The summation in the above formula is to be taken over all possible sets of i + r 
distinct positive integers < N, 

Lemma 3. Let ai, • • •, a(ife+r) he {k + r) distinct positive integers < N. Then 

(2.10) EiviVt • • • t;*t»(4+i) • • • ujl+r)) = E(Va^Vaj • • • • • • »«%+,,). 

This follows from the fact that all permutations of have the same probability. 
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Lbuma 4. La 





Then 

(2.11) E{L'/) - 0(i\r) 


Pboof: Expand Zi«’ and take the ejected value of the individual terms. The 
contribution to E{L/) of all the terms which are miiltiples of the type appearing 
in the right member of (2.10) with fixed k,r,ii, • • •, i,(k + »i + • • ■ + V “■ p), 
is, by Lemmas 2 and 3 


o(jv'*«>-*).2: ..-Zd;, 


all different 










Since ij > l(j =“ 1, • • * , r), it follows from the fact that k + ii + ••• + ir 
that 2r < p — A and that 2r * p — fc only if fi =* • • • =* ir =® 2. Now 


(2.12) 2[‘]-H-rSrS^-Js|. 


Hence the maximum value of 2 



A; + r is reached when r =* 


This proves the lemma. 

From the last remarks of the preceding paragraph we obtain 
Lemma 5. 



and k 


-P 


- 0 . 


(2.13) 


^ (Z • • • Z dZ • • • dZ)E{v\ .. • v)) - om. 


_ •*.«/ 
all different 


We now prove 
Lemma 6. 

(2.14) EiL'f,) = 0 

(2.16) E(Z,;’) = NEivl) + o(N) = AT + oiN). 

Equation (2.14) follows from (2.2). Consider the expectations of tibe various 
terms in the expansion of Ln. The sum of all the terms of the type 

did^Eiv^f) 

is 

(Ld'idi)Eivtv^ = OimOiN-^) - 0 ( 1 ). 

by Lemmas 1 and 2. The sum of all the terms of the type 

d?E(pl) 
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is 

(E E(v\) »= NE(vl) = N, 

by (2.2) and (2.3). This proves the lemma. 

Lemma 7. 

(2.16) E{vl • • • = 1 + 0 (1) 

(2.17) L • • • Z dl* • • • d«, = + om. 

all different 

From (2.2) and (2.3), and Ijemma 3, it follows that it will be sufficient to 
prove (2.17), because (2.16) follows in the same manner. Consider the relation 

“ 2 • “ 2 ^ 1^1 * * * + other terms. 

\i-l / 

all different 

By (2.9) the sum of these ether terms must be not larger than From 

this follows the lemma. 

Proof of the theorem: Since 

j* ^ Liv _ Ljf — E{Lfr) 

it will be sufficient to show that the moments of L* approach those of the normal 
distribution as iV —> oo. From (2.14), (2.15), and (2.11) we see that, when p is 
odd, the pth moment of L* is 0{N~^) and hence approaches zero as AT —> oo. 
When p is even and = 2s (say), it follows from Lemma 5 that 



This completes the proof of the theorem. 

It will be noticed that nothing in the foregoing proof requires that, when 
N < N\ the sequences Ajf and Dy be subsequences of and . Indeed, 
the sequences were written as they were simply for typographic brevity. We 
have therefore 
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Corollary 1, The theorem ie valid for eequmcee 

Ah =* {flHi > • • • f O^hh) 

Dh ** {dni , • • • I dsN) 

(N = 1, 2y • • • ad inf.) 
provided iheyfvlfiU condiHon TT. 

Corollary 2. // elements ay(t = 1, 2, • - ad inf.) are all independent 
observations on the same chance varidbUj all of whose moments are finite and whose 
variance is positive, the sequences Ah(N = 1, 2, • • • , ad inf.) wiUfulfiU condition W 
with probability one. 

3. The rank correlation coefficient For this well known statistic (see [3]) 

Ah * Dh ^ (1, 2, 3, • • • , N). 

The sequences Ah and Dh satisfy the condition W. For 

and hence, for r > 3 

firiAn) = ^riDn) = 0{ir). 

Also 

H2(,Ah) == tl2{DH) = 

Hence the distribution of the rank correlation coefficient is asymptotically 
normal in the case of statistical independence. This result was first proved by 
Hotelling and Pabst [3]. 

4. Pitman’s test for dependence between two variates. The distribution 
of the correlation coefficient in the population of permutations of the observations 
was used by Pitman [4] in a test for dependence between two variates which 
“involves no assumptions” about the distributions of these variates. In our 
notation, let 

(a,', di){i = 1, 2, • • •, AT) 

be N observations on the pair of variates A and D whose dependence it is desired 
to test. Then the value of the correlation coefficient is 

AT' Z diOi . 

At the level P the observations are considered to be significant if the probability 
that N~^ \ Lh\ he equal to or greater than the absolute value of the actually 
observed correlation coefficient is < p. 
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In his paper ([4], page 227] Pitman points out that if the ratios of certain 
sample cumulants are “not too large/' then, as iV —► <», the first four moments 
of will approach 0, 1, 0, and 3, respectively (the first moment is always 

zero). Our theorem and the relation (2.16) make clear that under proper 
circumstances all the moments will approach those of the normal distribution. 

5. Pitman's procedure for testing the hypothesis that two samples are from 
tilie same population. For testing the hypothesis that two samples came from 
the same population Pitman [5] proposed the following procedure: 

Let one sample be 

and the other 

Um+l > » • • * > fltm+n • 

Write m + n = iV, and construct the sequences As and As as before defined* 
Let 

d,- = 1 (t = 1, • • • , m) 

d* = 0 (t = m + 1, • • • , iV) 

and construct the sequences Ds and Ds . Then the value of the statistic con¬ 
sidered by Pitman is, except for a constant factor, 

(6.1) 

At the level the observations are considered significant if the probability that 
iV~* I Lir I be equal to or greater than the observed absolute value of the expres¬ 
sion (5.1) is < /3. 

Let iV 00 , while — is constant. Then the sequences Ds are seen to satisfy 
n 

condition W. If then the sequences As satisfy condition W we may, for large AT, 
employ the result of our theorem and expeditiously determine the critical value 
of Pitman's statistic. 

6. Analysis of variance in randomized blocks. Welch [7] and Pitman [6] 
consider the following problem; Each of n different “varieties of a plant” is 
planted in one of the n cells which constitute a “block.” It is desired to test, 
on the basis of results from m blocks, the null hypothesis that there is no difference 
among the varieties. In order to eliminate a possible bias caused by variations 
in fertility among the cells of a block, the varieties are assigned at random to the 
cells of a block. If the cells of the jth block are designated by (yi), (j2), • • •, 
(jn)y a permutation of the integers 1, 2, • • •, n is allocated to the jth block by a 
chance process, each permutation having the same probability (nl)‘“\ 

Let Xijk be the yield of the tth variety in the fcth cell of the jth block to which 
it was assigned by the randomization process. It is assumed that 

Xijh = Vik + , 
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where i/yn, is the of the ftth cell in thejth block, 6i is the *‘eflfecV' <rf th^ fth 

variety, and €/ib are chance variables about whose distribution we assume noth^^ 
The null hypothesis states that 

kb » - • • » » 0. 


Let ayjb be the yield in the A:th cell of the ith block and xa the yield of the ith 
variety in the jth block. If the null h 3 npothe 8 is is true then, because of the 
randomization within each block described above, the conditional probability 
that, given the set {ayfc}(A; « 1, 2, • • •, n), the sequence a;i/, xjy • • •, Xn,*, be any 
given permutation of the elements of {a/k} is (nl)~\ Permuting in all 
blocks simultaneously we have that, under the null hypothesis, given the set of 
mn values {ay*,} (j == 1, 2, • • • m; A; = 1, 2, * • *, n), the conditional probability 
of any of the permutations is the same, (n!)”***. This permits an exact test of 
the null hypothesis. 

The cisussical analysis of variance statistic that would be employed in the 
conventional two-way classification with independent normally distributed 
observations is 

F = («t - l)w 13 (»<■ - 3!)* 

12 H iXii — Xi. - X.j + *)* 


where 


Xy. = Wl ^ 2 Xij 

i 



X = (mny^ 22 2 • 


The statistic W used by Welch and Pitman is 

W ^ F{m-1 + Fr\ 


Since W is a monotonic function of F and the critical regions are the upper tails, 
the two tests are equivalent. The distribution of F or W is to be determined in 
the same manner as that of the other statistics discussed in this paper, i.e., over 
the equally probable permutations of the values actually observed. The critical 
region is, as usual, the upper tail. 

Since x<y takes any of the values ayi, • • •, ayn with probability 1/n, we have 

(6.1) Eixij) = 23 Oik = Oj (eay). 

* 

(6.2) a\xij) = n~‘ 23 (oyife - %)* * h (say)- 

k 

<r(i<,yx<,,) = [n(n - 1)]“‘ 12 - o* 

•= [n(n - 1)]”‘[(2 oa)* - 2 «**] - o* 


[n*o? - ^ a**] [n(» - 1)1 ‘ - o* 

(n - l)"‘[o* - n~^ ^ ®**1 “ -(» - 


(6.3) 
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Hmce 


E{X{.) = m“* 2 • 

ff*(xi.) - m"* 2 6/ -« 6 (say). 
cr(x<,.»*,.) = -[m*(n - 1)]“* S 6^ * c (say). 


ti t« 




(t, V - 1, • • • , n) 


where || X<» || is an orthogonal matrix and 


X«i — Xfij — • • • — X»i, — w 


Then it follows that 


£(*?.) = 0 
<r*(x*) = 6 — c 
ff(x?,.x?,.) = 0 


(t = 1, 2, • • • , « — 1) 

(ii ^ i*; ti, tj = 1, • • • , n — 1). 


Furthermore, we have 


X *** = X (*.•• - *)*• 


^-1 t-l 


Applying the well known identity 

XZ{Xij — Xi, — a;.y + xf = SS(a:,y — x.,)^ — m2(x,. — a:)* 
to the definitions of F and W we obtain 


. W = 


^ S — x)* 

X X (»« - »i)* 


The denominator of the right member of (6.9) is invariant under permutations 
within each block and equals 


Hence 


( 6 . 10 ) 


X X (flik - oy)’ = (n - l)m*(fe - c). 

i k 


W - [min - l)(b - c)]"* X (iV- t 
* [m(n — 1)(6 — c)]"“^ 2 


If the joint distribution of the a;?, (f = 1,2, • • • , n — 1) over the set of admissible 
permutations approaches a normal distribution with non-singular correlation 
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matrix as m, the number of blocks, becomes large, it foUoMns from (6.7) and (6.10) 
that the distribution of m (n — 1) TF approaches the distribution witih n — 1 
degrees of freedopi. Hence it remains to indicate conditions on th^ set (ay4 
which would make the distribution of the x* approa<^ normality. Each x?. is 
the mean of independent variables, so these conditions need not be very 
restrictive. 

According to Cram6r [8], Theorem 21a, page 113, if the variances and co- 
variances fulfill certain requirements (the limiting correlation matrix should also 
be non-singular) and if a generalized Lindeberg condition holds, normality in the 
limit will follow. Somewhat more restrictive conditions which are simpler to 
state and which will be satisfied in most statistical applications are that o < c' < 
6/ < c" for all j, where & and c" are positive constants. Since the variance of 
Xij is (n — ly^nhi , it can be seen that the above inequalities imply the fulfillment 
of the conditions of the Laplace-Liapounoff theorem (see, for example, Uspensky 
[9], page 318). By [6.7] the correlation matrix is non-singular. 

7. Hotelling’s generalized T for permutation of the observations. In this 
section we shall restrict ourselves to bivariate populations, the extension to more 
than two variables being straightforward. Let (uu , 'W 21 ), * * *, (uim , Uim) be m 
pairs of observations on the chance variables Ui, 1 / 2 , and (ui(m4-i), ii 2 (m 4 -i)), * • *, 
(uiif, v^ff)y be n pairs of observations on the chance variables Ui, Uj, where 
m + n = N. If each of the pairs Ui, I/ 2 , and Ui , U 2 is jointly normaUy 
distributed with the same convariance matrix, the Hotelling generalized f for 
testing the null hypothesis that 

(7.1) Em = E{U[) 
and 

(7.2) ‘ Em = EiUi), 

is given (Hotelling [10]) by 

2 2 

r* = N~\mn) 2 2 ?«(«< -^<)(% — “#) 

y-i i-i 

where 

m • N 

= 53 = 53 

{-1 J-m+l 

and the matrix || g,y || is the inverse of the matrix || b,-,-1| with bn given by 

m N 

{N — 2)bij = 23 (Mh *“ ^<)(WiJ •“ %) + 53 (Mh ““ ^<)(Wii ^/)* 

In Hotelling's procedure the bif are sample estimates of the population co- 
variances whose distribution is independent of that of the sample means. A 
constant multiple of the statistic has the analysis of variance distribution 
under the null hypothesis. If the population covariances were known and used 
in place of the bn , would have the distribution with two degrees of freedom. 
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Let TIB now apply the generalized T over the permutations of the actuidly 
<^l>eerved values, as was done with other statistics in previous sections. If we 
do this literally we will find that the bn are no longer independent of the sample 
means. To avoid this complication we shall use a slightly different statistic T' 
which, as will be shown later, is a monotonic function of T, so that the test based 
on T' is identical with that based on T. The statistic 7*' is defined as follows: 
Let 


ft-il 

e'n =» N[(,N — 2 («<* “ — 17,) {i,j, ■■ 1 , 2 ) 

h-l 

and 

\\Qii\\’-\\cn\r\ 

Then 

(7.3) - «J)(«/ - flJ). 

The expression T'* is much simpler than T‘, since the coefficients gj/ are 
constants in the population of permutations of the observations. We shall 
show that T'* is a monotonic function of T*. Let 


Qn “ 2 («« - «<)(«/* - %) + 2 («<* - «<)(«/* - ^1-) 


*-1 

N 


Q'n = 23 ~ ^7<)(u,» — Uj) 

hml 


Then the expressiozu 

(7.4) 

and 

(7.6) 


IIQ*'II» llQwir 


r? = Z Z - aJ) 

<»-l j—l • 


Tie'Ll. e"'(fi< - «<)(«/ - «;>. 

<.i ,-l 


are constant multiples of T* ahd T'*, respectively. Hence it is sufiicient to show 
that Ti is a monotonic function of Tl. We have 

(7.6) Qn = Qn + m(fi, - t7,)(fl, - ?7y) + n(«; - r7,)« - 17,). 


Furthermore, we have 

/t iT\ .1 n — 

(7.7) 


njHi - fl,) 
m + n 



m 


Similarly 

(7.8) 


ramniTATioNs or tkb oBa>isvi|.Tioii£| 


Ht - Ui «= «<' - - g<) 

m + n m + n ' 


From (7.6), (7.7) and (7.8) it follows that 


(7.9) 


Qii 




mn , 

Denote —— by X and w< — Hi by hi . Then we have 

fn •f’ n 

(7.10) Qii » Qii + \hihi . 

Denote the cofactor of Qa in |j Qa || by Ra and the cofactor of Q'a in || Q'a || by 
Bii . Then 

(7 ID I Qtj I _ - I I __ I Qti\ __ 1 

IQol \Qn + ^ihi\ \Qii\ + \S2Riihihi i + xrf* 
Furthermore, we have 

/>t io\ IQ«I _ \Q'u ~ ^hihj\ _ |Q«| — XS^Rijhihj 

1-^-I^;i-ieSIl-* 

From (7.11) and (7.12) it follows that TI is a monotonic function of Tl . Hence 
also is a monotonic function of and, therefore, we do not change our test 
procedure by using T'^ instead of T*. 

Let the sequence of pairs 

(xii. Xii)y • • • , (xiif , xts) 

be a permutation of the actually observed pairs 

(iin , Wai), • • • , (uiN , Uzir) 

where to each permutation is ascribed the same probability (iV!)’'\ Then one 
obtains for i = 1, 2, 

(7.13) EiXi - Xi) = 0 

(7.14) <r\£i - ii) = N[iN - l)mn]“‘£ («« - ?7,)* * Cu 

3-1 

(7.16) E{xi - iim - = NUN - l)mnl-‘ f) («„ - ri)(u« - - c(,. 

1-1 

Hence || || is the covariance matrix of the variates 

(fi - *i') and («, - «i). 
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Now we shall show that the limiting distribution of T'*, as AT is the x* 
distribution with 2 degrees of freedom, provided that the observation Uik 
(i « 1, 2; ft = 1, • • • , AT) satisfy some slight restrictions. Since j( 1[ is the 
inverse of the covariance matrix |1 1| our statement about the limiting distri¬ 

bution of is obviously proved if we can show that £i — and X 2 — 3c2 have 
a joint normal distribution in the limit. 

Let iV' 00 while m/n remains constant. Let the sequences and Dk of 
Section II be defined as follows: 

There are two sequences Ax , denoted respectively by Aix and A^rf , such that 


Oij = Uij 

(i = l,2;j =!,••• 

,N). 

di = l^ 

O' = 1, • • • , m) 




II 

1 

1 

0 = m + 1, • • • 

,N). 


Then the sequences Dx satisfy the condition W. If also the sequences Aix 
satisfy the condition W, the distribution of Xi — x'i approaches the normal 
distribution as N increases, by the theorem of Section 2. If the joint distribu¬ 
tion of Xi -- Xi and ^ — X 2 approaches a normal distribution with non-singular 
correlation matrix, the distribution of approaches that of with two degrees 
of freedom. 

The correlation matrix of (xi — x[) and (x 2 — xz) will be of rank two in the 
limit, if the correlation coefficient between. {xi — xi) and (xj — xj) approaches a 
limit p, where ( p | < 1. By (7.14) and (7.15) this is equivalent to saying that 
the absolute value of the angle between the vectors A\x and A^x is eventually 
greater than a positive lower bound. We shall show that, if the correlation 
coefficient approaches, as iV —> , a limit p whose absolute value is less than one, 
and if Aix and A^x satisfy the condition W, then — xj) and (x 2 xi) are 
jointly normally distributed in the limit. 

Let 5i and ^2 be any two real numbers not both zero. Then the sequence 

Ax = (ai, • • • , Ox) 

where 

a* = diOij + 6202? 

will be shown to satisfy the condition W. If either di or ^2 is zero this is trivial; 
assume therefore that neither is zero. Without loss of generality we may assume 

N 

that 53 ~ 0, for if this were not so we could replace the original o,-, by a'a = 
Oii — S Oii as was done in Section 2, Let p' be such that 1 > p' > | p |. 
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For iV sufficiently large we have 

^ £ ®iy "" 2 I 5182 ]C ^lyOay 1 + ^2 ®a/) 

/ / 

- ^ ^(^1 £ Oiy •“ 2p' I 6182 1 V(2 ®?y) ]C ®ay) 

y y y y 

= jv-‘[(i «i I \/E^ - i «21 )* 

y y 

+ 2(1 — pO I 81821 ®*y) ®iy)] 

and 

M2(-4j) < 2(8 iM2(-4ijv) + 82iLl2(-42J^r)). 

Hence 

(7.16) ii 2 (Ajf) = fl[max {p2(-4ijv), M 2 (yi 2 y)}]. 

Also fjLriAt) is a sum of constant multiples of terms of the type 

AT^ E ainlT*. 

i 

By Schwarz’ inequality 

(7.17) N~^ Z oiyaJT' < = (wy(v4ii.)yij(.wo(Aw))‘.. 

y y y 

The required result follows from (7.16) and (7.17). 

Since the sequences A* satisfy the condition TT, the limiting distribution of 

8 i(xi - xl) + 82(^2 - ^2), 

for any pair 81,82 not both zero, is normal. From this and a theorem of Cramer 
and Wold ([11] Theorem 1 ; see also [ 8 ], Theorem 31) it follows that if the joint 
distribution of (xi — x[) and {xi — £ 2 ) approaches a limit, this limit must be the 
normal distribution. From a theorem of Radon ([12]; see also Cram 6 r [ 8 ], page 
101 ) it follows that if the joint distribution of {xi — x[) and (x 2 — xz) does not ap¬ 
proach a limit as AT 00 it is possible to find two subsequences of the sequence 
( 1 , 2 , • • • , AT, • • • ad inf.) for each of which the joint distribution approaches a 
different limit. This contradicts the previous result. Hence the limit exists 
and is the normal distribution. This proves our statement that the limiting 
distribution of is the x distribution with two degrees of freedom. 

The statistic seems to be appropriate for testing the null hypothesis that 
two bivariate distributions Hi and 1 X 2 are identical if the alternatives are re¬ 
stricted to the case where 112 differs from Hi only in the mean values, i.e., the 
distribution 1 X 2 can be obtained from XXi by a translation. This is no restriction 
as compared with Hotelling’s T-test since also the T-test is based on the assump¬ 
tion that the two normal populations differ at most in their mean values, i.e,, 
the covariance matrices in the two populations are assumed to be equal. 
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ERROR CONTROL IN MATRIX CALCULATION 

F. E. Satterthwaitb 
Aetna Life Insurance Company 

1. Introduction. The solutions of large sets of simultaneous equations and the 
inversion of matrices are often complicated by the fact that errors, such as those 
introduced by rounding, become magnified in the course of the calculations to 
such an extent that the results are useless. In this paper we shall show that if 
the norm of the matrix A — / is less than 0.35, operations involving the inversion 
A or the multiplication by will be in a state of error control for “Doolittle” 
methods of calculation. Thus such calculations may be carried through with 
assurance that the errors in the results will be limited to two or three significant 
figures. We also point out that as soon as an approximation to A"”^ is available, 
most problems may be restated to bring them within the requirements for error 
control. Therefore the solution can be immediately completed to the desired 
degree of accuracy in one step instead of requiring multiple steps as do the 
iterative methods. 

2. The inversion of special matrices. Consider the problem of inverting the 
matrix (I + F) where I is the identity matrix and (I + F) is a non-singular sauare 
matrix. I^et 

(2.1) G = (/ + F)-‘. 

Then 

(2.2) (/ + F)0 = I 
or 

(2.3) G = I - FG. 

In ordinary algebra this would not be a practical formula for the calculation of (?. 
However in matrix algebra the situation may be different. Examine the ex¬ 
panded form of G: 

(2.4) gij = ^fikOkj^ 

The summation is over all values of k from 1 to n. Next examine the affect of 
imposing certain restrictions on F. For example, let * 0 if j > i. This is 
equivalent to making the summation in (2.4) over the range 1 to ^ — 1. The 
first row of (2.4) then becomes 

Qa = Bii 

and no appear on the right. For the second row 

gii = ^2} — fiigu • 

373 
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The only g^B on the right are those on the first row which have already been 
calculated. For the third row 

Qzi = 5a/ /aififi/ — . 

The only on the right are in the first and second rows and have already been 
calculated. Similarly for the fourth and later rows. 

Thus it is seen that if F is a ^^pre-diagonaP^ matrix, (2.3) is a very simple and 
practical formula for the numerical calculation of the inverse of (/ + F). Also 
if F is a post-diagonal matrix, (2.3) may be used by working up from the bottom 
row. 

Similarly, if a matrix H is to be multiplied by the inverse of (J •+• F), let 

(2.5) (?=(/ + FY^H 
and the working equation becomes 

(2.6) G H - FG. 

The inversion of a diagonal matrix is accomplished by inverting each of its 
diagonal elements. That is if 

(2.7) F = IIM//II 

then 

(2.8) F-^ = II ||. 

3. The inversion of general matrices. The general inversion problem will be 
solved if a general matrix can be factored into matrices of the special types 
treated in the last section. For the moment assume that such a factorization is 
possible and let the factors of the general matrix, A, be 

(3.1) A = (Fi + I)Si{I + Ti) 

where Ri is a prediagonal matrix, Si a diagonal matrix, and Ti a postdiagonal 
matrix. Then 

(3.2) A = aSi “b RiSi + SiTi + RiSiTi . 

A slight change in form now appears desirable so let 

(3.3) A = {RS-^ + I)S(I + S-^T) 

=- R+ S+ T + RS~'SSr^T 
= R + S+T + RS~^T. 


(3.4) 

For convenience let 
(3.6) 


B ^ R-^ S+T 
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and remember that B, S, and T have no non-zero elements in common, Th»:ie' 
fore the non-zero elements of R, S, and T are equal to the corresponding elements 
of B. Bearranging (3.4) gives 

(3.6) B = A - RS~^T 
and the elements of B are determined by 

(3.7) = 

Skk 

Since r,* = 0 for fc > i, there is no point in making the summation beyond 
fc = i — L Also since tkj — 0 for fc > j, there is no point in making the summa¬ 
tion beyond i — 1. Therefore the summation in (3.7) is to be considered to be 
over the range 1 to the smaller of i — 1 and j — 1. The r's, s’s and on the 
right of (3.7) can now be replaced by the corresponding Vs: 

(3.8) 6,, = ao - 

Since the first row (column) of Vs equal the first row (column) of a% the second 
row (column) of Vs is a function of only those Vs in the first row and the first 
column, etc., any calculation routine which works down from the top and from 
the left to the right will lead to a ready determination of all the 6^s by (3.8). 

Thus we see that the assumed factorization (3.3) of A is always possible (unless 
some of the diagonal elements, bkk , of B are zero) and moreover the elements of 
the factors are readily calculated by the simple equations, (3.8). 

Therefore, to invert the general non-singular square matrix A, calculate the 
elements of an intermediate matrix B^R +S + T hy equations (3.5) and (3.7). 
Then from (3.3) we have 

(3.9) = (/ + s-\i + RS~Y^ 

which can be readily calculated by the methods of (2.3) and (2.6). 

4. The Doolittle method. The Doolittle method of matrix calculation can 
now be expressed in terms of the matrices 72, S, and T studied above. To 
illustrate we shall use the set of equations: 

(4.1) aiiXi + + auXs = CuVi + Ci2t/2 + == di , 

0213^1 + 022 X 2 + 023 X 3 = C 21 I /1 + C 222/2 + C 232/3 = di , 

OsiXi + 032 X 2 + 033 X 3 = Cziyi + 032^2 + CzaVi = di . 

This set of equations will be represented in the form of a three element matrix, 
(4.1.a) ||AX:CF:Z)|1. 
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The essential feature of the Doolittle method of solution is that we replace (4.1) 
by an equivalent set of equations for which the prediagonal coefficients of the 
X's are all zero and the diagonal coefficients are all unity. Therefore consider 
the set formed as follows: 

(4.2) II A,X : C.r : D, II = -r‘{II AX:CY:D\\-R\\ AtY : CtY : D, ||}. 
Then 

(4.3) At = - filA,) 

or 

(4.4) iS + R)A, = A 
or 

(4.6) At = {S + 7J)-‘(B-Sr‘ + 7)5(7 + S-^T) by (3.3) 

= (7 + S-^T). 

Since S~^T is a postdiagonal matrix, || AtX : CtY : 7)$ || are the required inter¬ 
mediate equations for a Doolittle type of solution. 

The final solution is now easily obtained. Consider 

(4.6) li AtX ; CtY : II = || AtX : CtY : A 1| - {S-^T) || AtX : CtY : Dt ||. 
We have 

(4.7) ^4 - ^3 - 

or 

(4.8) (7 + S-^T)At = At 

^I + S-^T by ( 4 . 5 ). 

Therefore A 4 is in fact the identity matrix and (4.6) can be rewritten 

(4.9) II X : CtY : Dt || = \\AtX: CtY : D* 1| - (S'^T) |1 X : CtY : Dt || 

6 . The non-symmetric case. In actual practice, the work has to be so 
arranged that the elements of the matrices R, S, and {S~^T) are set out so as to 
be readily available for use as multipliers in forming the intermediate and final 
sets of equations. Table I ^ves such a practical layout for the non-symmetric 
case. 

The elements of (fiT^T) are set out as the postdiagonal elements of Az so that 
they do not need further attention. To determine the elements of R and S, 
we form a set of pre-intermediate equations: 

.. : ...|| = ||AZ: ... : ... \\-R\\AtX - X i ■■■ : ■■■ ||. 


(5.1) \\BX:- 






Layout of DoolitUe solution for the non-symmetric case. Coefficients not used further are indicated by • 

OnXi + + UitXt — C\iy\ + + Cnyi =“ rfi 

OiiXi + OnXt + auXt =“ Cjiyi + Cj*yt + Cnyt == dt 

OnXi + aiiXt + attXi = Cnj/i + Cnyt + CtaVt = 
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H Hu 
1^11 


I ‘ I » 


I ‘ I > 

•o'a "o -T? :a 

H H u H n tl 


•o -tj 
II H H 


'rs 'T?-o •o 'ts •o 

II II n II II n nun 


+ + + + + 


+ + + +++ 


't) 

II n H 


o 

« 'S 


+ + + 




9 .g ;3> ;»» ;a> 

S !3 8 

o » w « w 

I ® + + + 


^ S ^ 

cs o 
+ + + 


51 ^ “ 


5 5 « 
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Hieii 

B = ^ - R[At - I] 

= A - mi + S-^T) - I) by (4.5), 

= A'- RS~^T 

(6.2) = R + S+T. by (3.6) and (3.6). 

Therefore we see that the prediagonal coefficients of the x^& in (5.1) are the 
elements of R and that the diagonal coefficients are the elements of S. The rest 
of the coefficients in this set of equations are not needed in the calculations and 
have been indicated by dots in Table I. 

6. The symmetric case. If the A matrix is symmetric, advantage can be 
taken of the fact that the B matrix is also symmetric. Therefore 

(6.1) S+T ^ S + R' 

and the elements of S and R' can be written down just before the division by 8u 
in the calculation of the Az matrix: 

(6.2) A, = (/ + S-^T) = S-\S + T) by (4.5) 

= S-\S + R'). 

The layout of the work is given in Table II. 

If the C 4 matrix is symmetric, the prediagonal elements of C 4 can^be entered 
by symmetry. Therefore it is not only unnecessary to calculate the prediagonal 
elements of C 4 , but we can also omit the prediagonal elements of Cz . Note 
that in this case C 4 must be calculated from the right to the left as well as from 
the bottom up. 

The most important case where it is known in advance that C 4 is symmetric is 
the determination of the inverse of a symmetric matrix. Then C = / and C 4 
= Also the postdiagonal elements of Cz are all zero so that the only 

elements of Cz which have to be calculated are the diagonal elements. These 
are the reciprocals of the s^'s. 

A case where Ca is symmetric though C I will appear in a subsequent paper. 

7. Norms. In order to state the conditions for error control in a matrix 
calculation, a concept of the norm or the absolute value of a matrix is necessary. 
In this paper the norm will be defined as the square root of the sum of the squares 
of the elements of the matrix. That is 

(7.1) N{F) = 

The two basic inequalities satisfied by the norm are 

(7.2) N{F + G) < NiF) + N(0) 
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and 


(7.3) N(FG) < J\r(F)J^^(G). 

All other properties of the norm are derived from these. 
For future reference we list the following norm relations: 


(7.4) iV[(F)(J + (?)] = N{F + FG} 

< NiF) + N(F)N{G) 

< mF)[l + N{G)]. 


li N(F) < 1 we have 

(7.6) N[(I + F)-* - J] = iV[/ - F + F* -- I] 

< N(F) + [JV(F)]“ + [iyr(F)]» + • • • 
N(F) 

- 1 - N{F) ‘ 

If NiG - J) < 1, (7.5) becomes 


(7.6) 


1 + iV((r‘ - I) < 


1 

1- N{G- I) ‘ 


When N(F - I) < 1 

(7.7) NiF-^G) < N\I + - /)][G] 

< [1 + Ar(F-* - r)]N(G) 

^ NiG) 

- 1 - W(F - 7) 


S79 


by (7.6) 


8. Error matrix and error norm. We shall also need a formal statement as to 
what we mean by error and we need a measure of the errors. 

By an error matrix we mean the matrix whose elements consist of the differ¬ 
ences between the value of the matrix elements as actually calculated and the 
true value of the matrix elements which would have been obtained if all calcula¬ 
tions had been made exactly without any rounding or other approximations. 
The fundamental relation for the error matrix, F[/(G)], of a function, f(G), of G 
is 

(8.1) F[/(G)] = f[G + E{G)] - m. 

If each element of a matrix is calculated to q decimal places and the matrix 
has p rows and p columns, the maximum rounding error introduced in any 
element is .5 X 10~®. The norm of the error introduced by rounding is less than 


( 8 . 2 ) 


NEi = Vp*[.5 X IM® 
= .5 X 10”* X p. 




380 


r. B. SATTBBTHWATTB 


For triangular matrioes 
(8^) NEt * , 

Fw one oolunm matrices 
(8.4) NEt 


5 X IO'VpCp + l)/2. 


5 X 10-Vp. 

For future reference the following formulas for error norms are listed: 

(8.5) NE(F + (?) < NE(F) + NE(G). 

(8.6) NE(FO) < NE(F)NIG + E(6)] + NE(G)N(F) 

< NE{F)N{G) + NEiG)N{F) + NE{F)NEiG). 

(8.7) NE[F{I + (?)] < NE{F)[1 + N[G + i?((?)] + iVB((?)JV(F) 

< NEiF) + NEiF)NiG) + NEiG)N(F) 

+ NE(F)NEiG), 

liN{F - I) + NEiF) < 1, 

(8.8) NEiF-^) = NE[I + (F - T)]"* 

1_1_1 

+ [F + F(F) - /] / + (F - /)J 

N{I + [F+ EiF) - + (F - J)r‘{F(F)} 


= ^[r 


NEiF) 


^ {1 - NIF + EiF) - /]) {1 - NiF - /)} 
. NEiF) 


by (7.7), 


- [1 - N iF 

If NiF - I) + NEiF) < 1, 


1) - NEiF)][l - NiF - /)]• 


(8.9) NEiF-^G) = [j ^ 


G 




+ IF + F(F) - I] Z + (F 

[F(G)][I+(F-Z)] - [F(F)][(?] "I 
{Z + [F+F(F)-Z]j{Z+(F-Z)|J 

NEiG) + {[i\rF(F)]/[l - NiF - Z)]}{JV((?)} 
1 - iVIF + EiF) - Z] 


by (7.7). 


9. Certain maxima. The R, S, and T matrices have no non-zero elements 
in common so that 


(9.1) 


NiB - I) = NiR + 8 - I + T) 

= V[NiR)]^ + ms - Z)]* -I- liV(r)p. 


Similarly 

(9.2) 


NEiB) = ViNEiR)]* + lNEiS)f + [NEiT)¥- 
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m 


The developments of l^e following formulas for certain maxima are not given 
here since they involve only well known calcultis principles. It is understood 
that these inequalities hold whenever the quantities involved exist. Usually the 
maxima are given subject to the condition NiT) — NiJR) as well as for the 
unrestricted case where N(,T) may be zero. 


(95) 


(9.4) max 


(9.6) 


N{R)N{T) , r. -FiffTB-KT. 

j_iV(s_/) ^ -01 • 


iV(S - /) 

JV(ie) _ NiB- I) 


1- N(,S- I) {1 - [NiB - /)]»} 


lit 


if NiT) = 0, 


V2vT-whTf “ 


(9.6) max lNiR)NEiT) + NiT)NEiR)] = ^[^^(JB - /)]* - [NiS - /)]* 

X y/lNEiB)]^ - lNEiS)Y. 

Any substitution satisfying the relation 


(9.7) 


NEiR) __ NiT) 
NEiT) NiR) 


will cause (9.6) to attain its majximum. 

(9.8) Ta&x[NEiR) + kNEiS)] = Vh+l^NEiB) if NEiR) = NEiT), 
(9.8.a) = Vin^ArJS?(B) a NEiT) ^0. 


(9.9) 


max 


k + NiR) 


fc + 


[NiB - D? 


I- NiS- I) " ■ K 

where K is the root of 

(9.10) II + k^]K^ + 2[k{NiB - + {[NiB - I)]* - [NiB - 7)1*} = 0. 


10. Errors in the Doolittle method. In all that follows, we shall assume that 
NiA — I) is small enough so that the “divisions” are permissible. First let us 
examine in the multipliers B = R + S + T. By (3.6) 

(10.1) B-I = iA- I) - RS^^T. 


Therefore 


NiB- I) ^ NiA - 


^ , NiR)NiD 

i-ms- 1 ) 


by (75) and (7.7), 
by (9.3) 


(105) 


^NiA-D + l-Vl- [NiB - /)]* 
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Bemembering that A has no error and letting be the rounding error norm 
introduced in writing down the elements, we have 


/inoN »7P/x,^ ^ *7*. . mm{T) - NE{T)N(R) + NE{B)NE(.T) 
(io« mm<NB, + - 1 - - j] - 


NE(S)N(T)N(R) 

{1 - N[S + EiS) -/]}{!- N{S 


by (8.9) and (8.6). 


We are interrated only in the range of values where the errors are small. There 
fore we shall ignore second order errors. Except for such errors, 


NE(B) < NEi + 


A7B-7D\ 1 \TVfWS 

1-ms-I) + 1-ms-1) 


[1 - NiS - 7)p 


NE(S). 


The last term will be largest when N(T) = N(R). Therefore the sum of the 
second and third will be largest when NE{R) = NE(,T) by (9.7). 

(10.5) NE(B) <NE^ + 2 — NE{R) + [jpJ|^]VE(S). 
By (9.8) we now obtain 

(10.6) NE(B) < NEi + KV2T^ NE^B) 

NEi 

- 1 - KV2 + Kl 

where 

N(.R) _ - 1) ^ 

^ ““ 1 - NiS - I) a/2 Vi - [N{B - I)f 

Actually in practice we introduce the rounding error in instead of in T 

as assumed above. Our assumption is conservative since the division by S 
magnifies any error in T. 

Next consider the errors in the C» (or Z)») matrix. From (4.2) 

(10.7) C, = S-\C - RCt) 

= fiT'C - S-^RCt 

= (/ + S-^Rr^E^^C by (2.5) and (2.6), 

= m + iSr‘i2)r'C 

* (5 + R)~"C. 


( 10 . 8 ) 
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mc) 


1- N{R + S-D 


by (7.7); 


Therefore 

(10.9) N{C») ^ 

- 1 - N(B) -N(S- D • 

From (10.7), remembering that C has no error, and letting the rounding error be 
NEi, 


mc) 


(10.11) NEiCt) < NEt + 


NEiR)NiC») + NE(Ct)N[R + F(/2)] 


1 - N[S + E(S) - 7] 

NE{S)[N(C) + JV(/2)iV,(C,)] 

{1 - N[S + E(S) - 71}{1 - N{S - I)] 


by (7.7) and (7.3), 




1 - N[R + E(R)] - N[8 + E{S) - 7] 


smce 


( 10 . 12 ) 


N(0 + NiR)N{C,) 
1 - N{S - ly 

^ N(0 

I ~~N(S 


=T)[' + 


N(,R) 


1 - Nm - ms 
mc)[i - m- 1 )] 


-7)] 


by (10.10), 


- [1 - ms - 7)][i - mR) - Nis - 7)1 

and since transferring the terms in NE(Ct) to the left requires that we divide 
through by 

, N[R + E(R)] __l-N[R + Em-NlS+E{S)-I\ 

1 1 _ N[S + EiS) + 7] ~ I - NIS + E{S) - I] 

Again we can ignore second order errors. Taking maxima by (9.8.a) gives 

V2 NE(B)mCl 

(10.14) NEiC,) < ' 


I -V2N{B- I) 


+ NE» 


1 - V2 NiB - r\ 


and 

,.. NEiC,) . V2NEiB) [iV^j/[Af((7)] 

UU.lo; - [1 - V2Af(B - 7)1* 1 - \/2mB - I)' 

Thus we see that the proportionate error in Ct is made up of two parts: the first 
due to the roimding errors in the multipliers as given by the first term and the 
second due to the proportionate rounding error introduced in calculating Ct. 
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S&i 


Finally we have the errors in the C 4 (or Di) matrix. Since 
(10.16) Ct => A-^C 

we have 

00.17) 

By (4.6) 

(10.18) C*-C,-(S-‘2’)C'4. 

If we let NEt be the rounding error introduced in this step 


00.10) ma s me.) + m + 

NEiS)N{T)N(Ci) 

{1 - N[S + EiS) - /]} {; - N(8 - I)] 
{NE(C») + NE,}{1 - M-S + E(S)~ I]} 

+ rArF(r) + -™^ir_-I 

l- N[T + E{T)] - N[S + E{S) - I] 

by (10.17) and relations similar to (10.13). We now ignore second order errors 
and take maxima by (9.4) and (9.8.a): 


NE{Ci) (NE^ Vl + K^ NE{B) 

(10,20) < "(o_ „ .y.(g ^ , i ,=„^ K -jL 

^ 17(0 “ 1 - V 2 17(B - n 


[[1 -NiA -7)JVl-[1 - V2 W(F-/)]J 

[NEt]mO\ NE, 
l-\/2W(B-/) JV(0 
l-\/2N{B- D 


where 


( 10 . 21 ) 

and 


K = max 


NiT) 

1- N{S- I) 


NjB - I) 

Vl - ms - ijp 


by (9.4), 


(10.22) Vl + X* = 1/Vl - IJV(B - I)]*. 

if A is symmetric, NE(B — I) remains imchanged but NEiO) can be some¬ 
what strengthened through the use of (9.5) and (9.8) instead of (9.4) and ( 9 . 841 ). 
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(10.23) 


< f __ 1 _ 

TO L(1 - N(A - D]y/2Vl -~mB - /)]* 

+ ' ~|r NEjB) 

II - V3/2 N{B - /)]*JLl - VW2 N(B - 1) 
j. _1 NE* 

■^(1-V8/2Ar(B-I)]*TO 

_ 1 NE* 

1 - VmNiB - T) N{C)' 


If C is approximately equal to J, a better formula is obtained by substituting 
(Z + Cl) for C. The &ml fojraulas then are identical to (10.20) and (10.23) with 
the substitution of 1 + iV(Ci) for A^(C'). Similarly if C = / so that C« « A“‘, 
1V(C) = 1 should be substituted in (10.20) and (10.23). 

If A is symmetric and if C = Z so that Ci = A"*, the prediagonal elements of 
Cl are filled in by symmetry as in Table II instead of being calculated directly. 
This complicates the analysis of error relations. The following inequality gives 
the error limit for the diagonal and postdiagonal elements of (A“*). We have 
indicated these elements by F. 


(10.24) NE(F) < ~ NE(B) 

[1 - Ar(A - Z)) [1 - V 2 N{B -I)] 

NEi 

1 ^ y/2mB - I) 

where K is the root of (9.10) when fc = 1 — N{A — I) and NEi = 0.6 X 10”® X 
V p(p + l)/2 by (8.3). 


11. Results. Given the matrix A, we subtract one from each element on the 
principal diagonal to obtain the matrix (A — J). By the norm of (A — I) we 
mean the square root of the sum of the squares of the elements of (A — I), We 
shall now show that a Doolittle process such as outlined in Table I is in a state of 
error control if the norm of (A — /) is less than 0.35: 

1. N(A - /) < 0.36 

2. N(B - /) < 0.4642 by (10.2). 

3. NE(B) < 1.09 p by (10.6) and (8.2) 

if the maximum rounding error in any element is 0.5 and A has p rows. Thus no 
element of the multiplying matrices R, S, or T can have an error of greater than 
this amount. 

4. NE(C 4 ) < (44 X p X 10”®)Ar(C) + (6 X p X 10“") by (10.20) and (8.2) 

where q decimal places are carried on the left and r on the right. Thus if the 

decimal point in C is shifted so that jV(C) < 1, the error in any element of C 4 can 
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not amount to more than three significant figures if the same number of decinud 
places are carried on both the left and on the right (four significant figures for 21 
to 200 rows). 

5. NE{A~^) < three significant figures by substituting N{C) =* 1 since C = I. 
As we let N{A — I) become larger than 0.36, the maximum errors indicated 
by our formulas rapidly become very large. In fact they become infinite if 
N{A - I) = 0.414. 

Since for more than four equations the above formulas show errors in the 
second decimal place no matter how small N{A — I) is, it is suggested that as a 
general practice: 

1. The problem be arranged so that N{A — I) < 0.36. 

2. The decimal points in O' be shifted bo N{C) < 1. 

3. Three extra decimal places be carried in the calculations. 

• 

12. Preliminaiy adjustments. The requirement that N{A — I) should be 
less than 0.35 is not normally met in practical problems. If, however, an 
approximation to is available, the problem can almost always be rearranged 
to satisfy this condition. 

Thus if we are solving the equations such as given in Table I, 

(12.1) AX = cr = D, 

we are perfectly free to multiply through by any non-singular matrix F without 
disturbing the solution: 

(12.2) {FA)X = {FC)Y = (FD). 

Now if F is a sufficiently close approximation to A'“\ FA will be almost equal to /. 
Therefore iV(FA — I) will be less than 0.35 and a Doolittle solution of (12.2) will 
be in a state of error control. 

Similarly for the inversion of A, we can apply the Doolittle process to the pair 
of matrices 

FA : F 


just as easily as to the pair 

A : I 


since 

(FA)"' II FA : F II = A"'F"' || FA : F || 

= II A"'(r-'F)A : A"'(F"'F) || 

= llA-'A:A-'|i 
= 1|/:A"'||. 

Thus by taking F as a sufficiently close approximation to A"', we can bring an 
inversion calculation into a state of error control. 
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The computor should be cautioned that the multiplication by F must be exact 
and that no rounding is idlowable in this step. Our formulas aasoined that we 
started with matrices free of error. 

13. Further work. The principles used in this paper can be applied to the 
task of developing calculation routines which will be in a state of error control 
regardless of the size oiN{A — I). Enough work has been done to see that sudi 
routines do exist and do not involve prohibitive labor. The author expects that 
the most efficient routine will be to use these more elaborate methods to obtain 
an F such that NiFA — I) < 0.35 and then to use the normal Doolittle met&ods 
as outlined in section 12. 
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mWERSE TABLES OF PROBABILITIES OF ERRORS OF THE 

SECOND KIND 

Bt Emma Lehmsb 
University of California^ Berkeley 

1. Introduction. The problem of testing linear hypotheses was discussed by 
Kolodziejczyk [ 1 ], and later in greater detail by Tang [ 2 ], who computed a table 
giving the probabilities of errors of the second kind Pn for a range of values of 
the two degrees of freedom fi and / 2 , (/i = l(l) 8,/2 = 2, 4, 6(1)30, 60, <»)^ and 
for two fixed levels Pi = .01 and .05 of the probability of errors of the first kind. 
These tables are in terms of a parameter = 1(.5)3(1)8) whose statistical 
significance, or rather that of 

X = (fi + l)*p^/2 

is discussed in Tangos paper. A restatement of the problem of testing linear 
hypotheses in a more canonical form, giving an interpretation of X, will also be 
found in a recent paper by Wald [3]. 

Professor Neyman has felt for some time that a table giving <p = v?(a, i>, a, fi) 
as a function of the two degrees of freedom fi = 2 a, and = 26, and of the two 
probability levels a == Pi and ^3 = 1 — Pn would be more useful for statistical 
purposes, where fi is the probability of detecting the falsehood of the hypothesis 
tested. A paper by Professor Neyman explaining this point of view and giving 
applications of the present tables to some statistical problems will appear 
shortly. These tables were computed in the Statistical Laboratory of the Uni¬ 
versity of California,* and give values of ^for the following range of parameters; 

(a, fi) = (.01, .7), (.01, .8), (.05, .7), (.05, .8) 

/i = 1(1)10, 12, 15, 20, 24, 30, 40, 60, 80, 120, oo. 

/2 = 2(2)20, 24, 30, 40, 60, 80, 120, 240, oo. 

2. Analytic definitions. The statistical parameter 

(1) X = X(a, b, a, /3) = (a + i) v{a, b, a, fi) 

can be thought of as an inverse function connected with the hypergeometric 
distribution. Inverse functions y{a), u{a, a) and a?(a, b, a) of the better known 
normal. Gamma and Beta distributions respectively have all been tabulated, 

^ The notation m ■■ r(8)t is equivalent to m ■= r, r + «, r 2«, ... , ^ 

* These tables were begun by Miss Leone Gintzler, and were carried on by Mark Eudy 
under a University of California Research Grant. The bulk of the computing was done, 
however, by the author and by Mrs. Julia Robinson under a grant of the American Philo¬ 
sophical Society. 
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and are sometimes called “percentage points'’ of the distribution. To b^;in 
with the simplest, tlie normal distribution, we may define y(a) as the solution of 



This function has been recently tabulated by Truman Lee Kelley [4] for a »■ 
p O(.OOOl) 1 to 8 decimal places. 

The function u(a, a) is the solution of 

(3) I <*■*«■'* *= r«(a)/r(a) =* «. 

This is connected with the well known percentage points of the xr distribution 
with V = 2a degrees of freedom as follows: 

(4) xJ(«) = 2M(a, 1 — a). 

Catherine Thomson [5] has tabulated x5(a) for a = .005, .01, .025, .05, .1, 
.25, .5, .75, .9, .95, .975, .99, .995, and for r = 1(1)30(10)100. She has also 
tabulated [6] the corresponding parameter x = x(a, b, a) of the Beta distribution 
with vi = 25, Vi = 2o degrees of freedom defined by 


(5) 


B(o, h) 


i 


a{a,b,a) 


r\i - t)^Ut = 


B,(o, b) 
B(a, b) 


a 


for a = .005, .01, .025, .05, .1, .25 and .5 and vi = 1(1)10, 12, 15, 20, 24, 30, 
40,60,120, Vi = 1(1)30,40,60,120, « to five significant places. 

Similarly X(o, b, a, B) can be defined as the solution of 

1 o) 


where 

F(t,«,s) = l + ]z+t|l^V + 

is the confluent hypeigeometric function. 


3. Limiting 

(7) 


cases. It is well knoAvn that as a tends to infinity 
X»(tt) _ u(o, 1 - tt) 1 I KM ^ 0 I 

«< ^ /n 


There are many approximations [7] to In a recent paper Peiser [8] gave a 
rigorous derivation of an asymptotic formula for 
Similarly, the limiting cases of x{a^ b,a)&aa and b tend to infinity are known 
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Level of Significance, a = .05. Probability of Detecting the Faleeb 
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Level of Significance, a = .05. Probability of Detecting the Falsehood of the Hypothesis Tested 
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[9] and follow readily from (5), although no attempt haa been made, as far as I 
know, to find better approximations. These limiting values are as follows: 

(8) lim a[l — x{a, b, a)] = «(6, 1 — a) 

(9) lim 6a:(o, 6,a) — «(o,a). 

6-*oo 

The two corresponding limiting cases for X are not at first glance so symmetric. 
When a tends to infinity, we have from (1) 


(10) lira - = V* 

a-*oo Ct 

while 

(11) Km Fi-b, a, -wph) = (1 + 

a-*00 

Substituting these in (6) and letting t = I — z/a{l + <p) and passing to the Kmit 
we get with the help of (8) 


( 12 ) 


1 

r(6) 


/: 




e~‘z*'~^dz =1-/3. 


In other words 


(1 = u(b,^) 


or 

(13) 1- 

This is the only case, except for 6 == 1 in which <p can be given explicitly. 
For 6 = 1, we have from (5) 

x(a, ly a) == a 

and (6) can be easily integrated to give 

via, 1, a, /3) = [^log - «)] • 

In all other cases it was found impractical to attempt an inversion of (6) to get <p. 
When b becomes infinite (6) becomes with t = z/b, and with the help of (9), 

-u(o,l~a) /-\(a-l)/2 ‘ _ 

(14) e-* e- j /*-i (2^X2) & = 1 - /3 

where 

/.-.(2VS) . 


( 16 ) 
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is the Bessel function of a purely imaginary argument, whi(^ is usu^ly d^ned by 


In{x) - 


£ 




(r 


vlrin + V + 1) • 


Expression (14) was also obtained for this limitiog case of the hypergeometric 
distribution by Wishart [10], This int^al, however, does not give X any more 
explicitly than the general integral ( 6 ), and since the calculation of X increases in 
difficulty as a increases, an attempt was made to derive an approximate formula 
for ^(a, 00 , a, /5) for large a. To this end an asymptotic formula [11] was de¬ 
veloped for /»(nx), the principal term of which is 


(16) 




(1 + / Vi + sfi — 1 

^/2lcn \ X 


Substituting this into (14) with x ^ 2 \^/a — 1 and n ^ a 1 we get for 
large a 


(17) 




0 


^a—Ig—I (1— l%a) 


d{ = 1 - d 


If we assume that <p is sufficiently small to neglect the term in <f>* we get as a 
first approximation 


(18) 

or 

(19) 


1 tt) 

= 7 -. / dt = 1- 

r(a) Jo 

If) 


a formula veiy similar to (11). In fact since a is large this formula can be 
reduced one more step using (7). This gives 

( 20 ) lim 'v/^*(o, ®, a, d) = J/(«) — y(d). 

o-*oo 


Similarly (13) becomes 

lim •\/av*('», b, a, d) = via) - yifi). 
6-+00 


If instead of neglecting the term in ip*, we multiply it by the value of t at its 
upper limit, we get 


( 21 ) 


« 1 — Vl — 2 [«(a, 1 — a) — u(o, 1 —' d)]/o 

ip -- ' . ■ ■. . ■ . —■ 

m ( o , 1 — a)/a 


Professor Neyman derived another approximation for <p{a, «>, a, ff) by assuming 
that the distribution (14) approaches a normal distribution for large a. He 
obtained: 


atp ~ y(«) 4- 


y/a 




( 22 ) 
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Both (21) and (22) obviously reduce to (20) in the limit. The following table 


shows the efficiency of these formulas for a 

; - 60 





Table 

(21) 

(22) 

(20) 

a = 

.01 

.687 

.695 

.668 

. .640 

0 - 

.8 





la = 

.01 

.647 

.642 

.622 

.607 

[0 = 

.7 





fa = 

.06 

.603 

.585 

.503 

.666 


.8 





Ja = 

.05 

.557 

.540 

.544 

.529 


.7 






A rigorous derivation of some such formula giving the actual order of approxima¬ 
tion of <p would of course be of interest, but is likely to be quite complicated. 


4. Calculation of tables. It is fairly obvious that the integral (6), although 
very useful theoretically is not well adapted to actual calculations. It can 
easily be integrated by parts to produce the infinite series. 


(23) 


«x ^ X* Bi-,(a 4 - iy h) _ -x X* Bx{by ci + i) ^ ^ 

h i\ B(a + iy b) h il B(a + i, b) 


This series can be used effectively for calculation purposes only if X is compara¬ 
tively small. If b is an integer, however, this series can be replaced by a finite 
series of b terms, which was also used by Tang [2] in calculating his tables. 
This series is as follows with x == x(a, &, 1 — a) = x(by a, a): 

(24) ETi = 1 - 0 

t-O 

where 


To =1, Ti = x[X(l — x) + a + & ““ 1]/(1 — x) 

and* 

(26) Tn = - x) + a + b — n] Tn-i + XxT„^ 2 }/n(l - x). 


The subjoined tables can be thought of as inverses of Tang's tables, and could 
have been obtained from tables such as Tang's by inverse interpolation, had the 
interval of tabulation been sufficiently fine. The interval of tabulation of .5 for 
<p allowed only a crude approximation or trial value of the corresponding 
probability was then calculated for this point, and then corrected with the help 


* It will be noticed in comparing these formulas with those given by Tang, that x is used 
for 1 — X. This is done to conform with Miss Thomson’s table for x. 
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8 ^ 

of derivutiveB. la the beginning of the work, a recalculation was usuidfy 
made for the corrected value of and the tabulated value of tp was then obtain^ 
by inverse interpolation between these very close values. As the work pro¬ 
gressed, and difference tables were calculated in several directions, the guesses 
improved considerably so that a correction could be made using the first deriva¬ 
tive to give a tabulated value of <p correct to three dedmal places. Such corrections 
never exceeded .004, and theref^ it is hoped that the tables are correct to the 
last place. The derivative in question is given by 

(26) g = (2o -1- DvxO. - , 

and was obtained as a by-product of the calculation of (24). This method was 
used for all finite a's and for all Vs less than 30. For 6 = 30 or more it was 
found more expeditious to use the infinite series (23), about 20 terms of which 
sufiSced. 

The values of x(a, 5, 1 — a) = a;(6, a, a) used in these calculations were o^ 
tained to five significant places from Miss Thomson's table [6] for a = .01 and 
.05 with vi = 2a, V 2 = 26. 

No calculations were made for non-integer 6's since for small values of 6, 
X was too large to make the infinite series (23) practicable, while for b ^ 7, <p can 
be easily obtained by interpolation. The only available method for calculating ^ 
for 6 = 1/2, 3/2 or 5/2 would be by numerical integration of (6), which would be 
rather lengthy. Furthermore, the interest seems to be in large rather than small 
values of a and 6. 

5. Calculations for infinite cases. The case a = co was readily disposed of 
using (13), as for 6 = oo, the integral (14) was again integrated by parts to give 

«o /.A(a+r)/2 

(27) g (^) I^,{2VuK) = 1 - 

This was found to be effective, especially when u <\, which is the case for small 
values of a. When u exceeded X, the complementary series was used, namely 

•0 AA(a—1—r)/2 

(28) g j /_i_.(2v^) = /9. 

The calculations proceeded in much the same manner as in the finite case. The 
values of 2u were obtained from Miss Thomson's table of percentage points of 
x5 (a) distribution with v = 2a degrees of freedom, while the values of lo and h 
were obtained from the tables of Bessel Functions [12] for 2 \/\z < 20, and from 
the tables of Anding [13] for larger values of the variable. The values of Ip for 
V > 1, were computed from the recurrence relation^ 

cosh z 

^ If a is an odd integer, the values of /»(*) can be built up iising (29) from 7-^ (z) 

and 74 ( 2 ;) — — , which of course €u:e tabled. 
y/irz 
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(29) 7^1 - -5:/,(*)+ 

z 


Future calculations of this sort could make use of the forthcoming tables [14] of 
7 ,( 2 } for » <20, and 2 ^ 20 to ten significant places. As before, the tabulated 
imlues were obtained by correcting trial values of <0 by means of the derivative 

(30) g - V2«(2a + 7.(2v^), 


which again was obtained as a by-product of the calculations. This method 
becomes impractical when a is too large, because a great deal of accuracy is lost 
in applying recurrence (29) many times. For some of the larger values of a it 
was found preferable to use the series 


(31) 




-X ^ ^ rtt(a + t) 

M i\ r(o + i) 


although it converges rather slowly. In other words the upper limits for a and b 
were pushed as far as was practicable. 

The values which are tabled to two instead of three decimals were interpolate 
using second differences, all other values were computed in the manner described 
above. Difference tables were made by rows, columns and between tables, as a 
final check on the work. Difference tables using harmonic interpolation were 
also made for both rows and columns, and found very effective, with the exception 
of the lower right hand comer, where tp drops rapidly to zero. The last column 
of each table is to be used for harmonic interpolation. 
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t 

1. The main purpose of the paper is to establish some combinatorial formulas 
concerning the mathematical expectations or probable values of a product of 
n given polynomials. The problem may be stated more definitely as follows: 

Let xi, • • • , Xn be n non-negative discontinuous variables for which we have 
assumed that the probability tbat each x takes a possible value is equally likely, 
and let /i(x), • • • , /n(x) be n given polynomials. Then we shall ask: What 
is the probable value of the product /i(xi) • • • /ft(Xn), provided the sum of the 
variables Xi, • • • , Xn is known? More generally, we m^ty consider the problem 
with certain restrictions to x such as a < Xi < b, (i = 1 • • • n). 

By a limiting process^ it will be foimd that all the formulas established for 
the preceding problem can be extended to the case of continuous variables. 
On this account, it is important to find explicit formulas for the problem merely 
involving discontinuous variables. 

By the definition* of MacMahon, we say that a set of numbers (xi • • ‘ x«) 
is over all different compositions of m into n parts with each x > fc, if (xi • • • Xn) 
runs over all different integer solutions of the linear equation Xi + • • • Xn = m 
with each x > k. We shall use the notation (m; ifc; Xi • • • Xn), or simply (m; 
fc; x), to denote that a set of numbers (xi • * • Xn) is over all different compositions 
of m into n parts with each x > fc. 

The notation E{m;8; [/i(x)] * • • [/n(x)]) will be used to denote the mathematical 
expectation of the product/i(xi) • • • /n(irn) iu which the sum of n variable quanti¬ 
ties xi, • • • , x« is known, namely xi + • • • + Xn = niy and each quantity is a 
multiple of 8 and m is of course a multiple of 8. Thus by the definition* of 
mathematical expectations we have 


(1) E{m; 8; [fi] 


[/-]) = ( E 1) 

(«*/«; i; *) 


E MXlS) 

(m/iTi; x) 


where the summation on the right-hand side runs over all different compositions 
of m/8 into n parts with each x > 1, and the given constant 8 is called a ‘Varying 
unit^\ that is the least possible difference between two unequal quantities in 
(xi • • • Xn)- If the varying unit approaches zero, (xi • • • Xn) will become a set 
of continuous variables. 


1 The limiting process will be illustrated by the proof of corollary 2 of theorem 1 in this 
paper. 

* MacMahon, Combinatory AnalyBis, Vol. 1, p. 160. 

< See for example W. Burnside, Theory of Probability, Chap. 4,13. 
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If /i(*) »•••-> /«(*), we may write 

E{tn', 8; [/]*) instead of E{m; 8; [/i] • • '.[fii]). 

A wdl-known convention for is also adopted here: 

ml 


nf(m — «)!* 
0 


if 0^n£ m, 


otherwise. 


2. Laninas. In order to obtain explicit formulas desired we first establish 
four lemmas as follows: 

Lsmma 1. Lettn,ri, ,rnbe non-negaiive iiUegen. Then 


( 2 ) 



m + » — 1 
• • • + r, + n 



Pboof. (Construct a generating function: 

C 4 -r'(r^r' 


i*i < 1. 


It is observed that the coeflSicient of the term in the expansion of the 

above product is given by 


/ri + a5i\ . , , An + aJn\ as Y) ^^A . , . ( 

\ Xx ) \ Xn / iJtx) Vl/ \rn) ' 

On the other hand we see that the coefficient of the term in the 

Yl+...+r„+n 


es^ansion 

+ 


of(i4- 




is given by 


C‘ 


+ Jn + n + m — (ri + • 
m — (fi + • • • + r„) 


+ r„) - 


l\_/m + n — l\ 
/ ~ ySfi + n - 1/ ‘ 


Hence the lemma. 

TiWMMA 2. Let a,h,c, ‘ • he arbitrary constants, and h ,kt,kt, •••be positive 
integers. Then 


( 8 ) 



«! /3l 7 ! 


Pboof. Expanding the left-hand side of (3), we see that the coefficient of the 
term o • • • is equal to 

itl y* /®i\ / *a\ /a:«+i\ f ^o+A /A .. /*«+«■ A ... 

al/Slyl... (m;o:s) \fci/ \^i/\ ^ / \ kt )\ ki J \ kt / 
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Bylemtna 1 itisreduoedto , 

nl / m + n — 1 \ 

a!j8l7l • • • \aki + 0 kt + ykt + ••• + »»— 1/ 

SubBtituting, we get the lemma. 

From now on, we shall frequently write /**^ instead of /(*), so that 

/ V”++ • • •+(;)/“»“ -(/+»)“. 

Lemma 3. Let m, n (< m) be two positive integers. Then for any given poly* 
nomial f(x) of the kth degree we have 

(4) E m •■■/<*.) - «i E GrAV 

(»:o:p) \o(p) + n — 1/ ,».o p,I 

where 

= /(«), S(j)) = 1-pi + • • • + &-p*. 

Papop. Since fix) is a polynomial of the fcth degree, there exist (k + 1) 
values 01 ,, ,00 such that 

By putting a: — 0 , 1, • • • , fc, we find successively* 

(5) 0, = - (j)/""” + • • • + = Cf - 

(p = 0,1, • • • , A;). 

The lemma is thus obtained by means of (3). 

For convenience, we denote the summation 

Z /i(®i) •••/«(*») by ^(m, [/J W). 

Thus formula (4) can be rewritten: 

r/n - «! V m + n-1 \ 0 J! 0 J!. ..Jjl 

where ft , ft, • • • . /3t are given by (5). 

LemmX 4. Let fiix), • • • , /«(x) be n given polynomiala. Then 

(6) Sim, [/J • • • [/«]) =1 Z (- 1)*~?-S(m. [/„+•••+ /J"), 

^ Strictly speaking, the relation (6) is established and proved by induction on f, in 
which a well Imown combinatorial equality is applied. 
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when the eummation on the right-hand side runs over all different eonibinations 
(Mrf 0/ (1 • • • n), (fc = 1 • • • »). 

For example, if n » 3, then all the different combinations out of (123) will 
be: (1), (2), (3). (12), (13), (23), (123). 

Proof. Consider a typical term 


nl 
31! ••• 


Sitn, (/„]•* 




where 1 < < < «, gi + • • • + gt = n. Now a necessary and sufficient condition 
that the term will be contained in the expansion of Sim, [/m, + • • • + /»•»]") is 
(vi • • • vt)ei)ii ■ • • Mt). i-e- ivi • • • vt) is a combination out of (mi • • • m»). tind for 


a fixed k, there are 



-different combinations of (mi ■ ” m*) satisf}dng the 


condition (vi • • • i'()*(mi • • • M*)«(l ••• n). Therefore the number of occurrences 
of the term in the right-hand side of (6) is given by 


n—f 

Z (-1)' 





(1 ^ !)"-' 


0 if t <n, 
1 if t ^ n. 


The term vanishes generally except gi = • • • = = 1. Hence the right-hand 

side gives S(m, Lfi] • • • [/n]). 


3. Theorems and corollaries. In the following statement of theorems and 
corollaries the notation (xi • • • Xn) will be always used to denote a set of unde¬ 
termined quantities as specified. 

Theorem 1 . Suppose that (xi • • • x») is a set of natural numbers in which only 
the sum of the numbers is knovm^ namely xi + • • • -i’ Xn = m. Then^ for any 
given polynomial fix) of the kth degree, the mathematical expectation of fixi) • • • fixn) 
is given by 


Eim, 1, m 

= nl 


(7) . (m - IV ^ \{if-ifT •••[(/- I)'*']"* 

\n - 1/ \S(p) -h n - 1/ 


Pol 


Pkl 


ikp) \S(p) + 

Proof. Let m' = m -f nr. By lemma 1 we have 

V AA Vi- V A 

(ir.) Voy Voy " (ii) " \ « -1 )' 

This is the number of compositions of m' into n parts with each part > r. In 
particular, if r = 1, we find that the number of compositions of m into » parts is 

~ Thus by (1), the required value is equal to 

{S(m, [ID }-*5(m, [/]•), i.e. " j)’5(m, [/]"). 

The theorem is therefore proved by lemma 3. 
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CoBOiiLAST 1 . Let (xi ■ ‘ • x») be a ut of poeitwe gwnUHee, of vihieh ^ 
varying unit is S, and the eum is m. Then, for given pailiynomM fix) of the 
kBi> degree, m have 


B!ifn,S, [/]•) 


( 8 ) 




L 

\Si3>) + 



i)^"r - • [(g- i)“*r 

3B»1 • • • p*l ’ 


where 


gix) = fixi), Sip) = Pi +-h *Pt. 


Proof. It follows immediately by the relation: 

Eim, d, [/(*)]“) = E , 1, [/(«*)]*). 

Corollary 2. Let (xi ••• **) be a set of noru-negativdy real manbere, of which 
the mm is known, namely Xi + - ■■ + Xn = m, m being a known real number. 
Then, for any given polynomial fix) = Oo + • • • + o*i*, (o* 0) we have 


Eim, 0, [/]") 

(9) ^ (hi)! T ••+*•« (0!flo)«o _ _; (fe!at)“ 

n (n;o;p) (l-ffi + • • • + kgk + n — 1)! (?»! g*! 


Proof. The proof of the corollary depends essentially on the concept 
that two unequal real numbers may differ by an arbitrary ^all number h. 

Let h be an arbitrary positive number and write fixh)/}f — gix, h), where 
the number k is Ihe degree of fix). Then, since 


n 


Ec-ir 

F-.0 



fo 



if p <n 

if p — n 

if p = n + 1, 


we may write 

g(v^ h) — flf(v — 1, ft) + • • • + ^{p\a, + ft-fl^ft)), 


where 

lim R,ih) =!(;) - (;) iv -+ 


K = 0, 1, 
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Now we pass to the limit h-* 0 m which we assume that h nms throu^ a sequence 
of real niunbers of the form ^, N being a natural number. Th\is by coroUmry 1 
we have 


lim E(m, h, [/]") 



^ ttl(n — (ytar)*" 

(n: 0 ip) (Sip) + n — 1)! 1-0 p,\ ’ 


(5(p) = l>pi + ' • • + 


Hence the corollary 

CoBOLiiABT 3. Jbei (®i • • • *0 be o «e^ of positive real numbers under a known 
emdition o < ®i + • • • + x» < b, where a (< b), b are two positive real numbers. 
Then, for any given polynomial f(x) = oo + • • • + o*a:*, (a* 9 ^ 0), the malhematical 
expectation of the product /(xi) • • • /(x«) which we denote by Eiiab), 0, [/]") is given 
by the formtda 


(W) Eiiab), 0, m = 


nl(n — 1)1 
b — a 




(nV») (1 + S(q)).{n - ,1 + S(g))I 


-ffo /T.I . \Qk 

gil • • • g»! ’ 


(S(g) 


gi + ••• + kqk). 


Proof. Since the probability that the sum of xi, • • • , x» takes a value 
between a and b is equally likely, we see that the required mathematical expecta¬ 
tion will be the mean of [ E(u, 0, that is 

Ja 

E(m, 0, ifT) * jf Eiu, 0, [/]») d«. 


* Thia corollary can also be proved by means of Dirichlet’s integral. In fact, the right- 
hand side of (9) is given by the quotient of the two integrals: 

{f dxi ... da:0/(/ ... fd^i .. dx^s), 

the integrals being taken over the region: + ... + Xn ■■ m, oji ^ 0, ... , ^ 0. 
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The fonxulla (10) is therefore obtained by integrating the rightrhand side of (9) 
and dividing it by (b — o). 

On the oi^er hand we find that 

lim a + h),0, [/]") » E(a, 0, [/]"). 

A-*0 


This shows that the corollary 2 can be also deduced from 3. 

Theorem 2. (A generalization of theorem 1). Letfi{x), • • • , /,(*) be n given 
polynomials whose degrees do not exceed k. Then we have 

E(m, 1,[/J ••• [/,]) 

/ m + n- 1 \ 

^ (-i)n- A^(p) + » — 1/~ 

(n;0;]>) /Wl — l\ /—O P/I 

U-w 

where 

fpi-pt = ‘ ^(p) = 1 ’Pi + • • • + k‘Pi. 

Proof. In the proof of theorem 1 we have shown that 
E(m, 1, [/]") “ I iyS(m, [/]"). 


Hence, by similar reasoning and lemma 4 we obtain 

E{m, 1, [ft] [/»]) = (^1 1) [/.]••• [/»]) 

- 2 -- —rr E(m, [/„ + • • • + f,,T) 

.. 

( wt + n — 1 \ 

Sip) + n - 1/ 

C-'O 


(n- 


Z Z (-1)" 

(Fj•••»,)<(!• on) (n;0;p) 


n i (/>..- 

M P/I 


Theorem 2 is proved. 

Corollary 1. Let She a varying unit. Then 


•••[/.]) - Z Z M 

..Vs(p) + „_l/ 

/m- iVfrKg, .. ~l)^r 

\n-l/ P/I 


(12) 
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tofcere 

*/»(*«),(»' = 1 ••• n) and ff,,(a:)+•••+ g,,(»). 

Proof. This is almost trivial because of 

S, [/x(a;)]---t/n(x)]) = 1, [/i(x«)] •••[/-(*«)]) . 

Corollary 2.® For any given positive real number m, we have 

(13) E{m, 0, [a:'*] • • • [x'"]) = 1)! 

(Pi + • • • + Pn + n — 1) I 
Proof. By a passage to the limit 5 —»0, we get 

lim E(m, S, [/,] • • • [/.]) = lim E «,[/„+•••+ /„]’*) 

- E lim Eim, d, 

n ! 5-^0 


i.e. 

E(m, 0, [fi\ • • • [/„]) = E E(m, 0, 


The corollary is then deduced by (9). 

Theorem 3. (Further generalization of theorem 1). Let (xi • • • Xn) be a set 
of arbitrary integers restricted to the conditions: 

Xi+ ••• + Xn ^ m, a < Xi<b, 


where m, a, b are all known integers. Then for any given polynomial f{x), the 

mathematical expectation of the product /(xi) • • • /(xn) denoted by E (m, 1, [/]”) 

( 0 , 6 ) 

is given by the following 


(14) 


E (m, 1, [fD 
( 0 . 6 ) 


Z/-i)'(”)s«iprw"-') 



9 


vhere 

g{x) = /(a: + h), h(,x) = f{x + a — 1), m' = m — (o — l)n + (a — 6 — I)!*. 


Proof. Define 


S(rn, [/]") = 0 for m < n and S(m, [/I®) 


,0 for m > 0, 
ll for m = 0. 


> This can also be deduced by Dirichlet’s integrals. 
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We shall show that 

ti-iy(fj8{m',\gY[hr-^= £ 

r-O V/ (»l-***is) 

where on the right-hand side of the expression the set (xi • * • Xn) under the 
summation runs over all different compositions of m into n parts with each 
o < < b, (i = 1 • • • w). 

We denote the left-hand side of the expression by ©, then by decomposing 
S(m',tor[/i]"*“Owehave 

® = i: £ (-1)' (”) -S(^, [fix + b)DSim' - M, [fix + a - 1)]""'). 

pm^O ni^p \V / 

Let/(xi) • * * f{xn) be a term contained in ©, i.e. xi + * • * + Xn = m, > a, • • • , 
Xn > a. And we suppose that > 6 + 1, • • • , ^,4 >6+1, where 5-^ vj 
if i 7 ^ j. Then it is found that the number of occurrences of the term in @ is 
given by 

£(-ir 

s-O 


C) - (■ - *>' 


fO if < > 1, 
1 if f = 0. 


This shows that the term f(Xi) • • • f(xn) of @ generally vanishes except a < Xi 
< 6. Hence we have 

@=2 m---f{x„). 

a^x:&b 


Next, we shall find the number of compositions of m into n parts with each 
a < Xi < bj i,e. the number of terms in ©. By the result just obtained we see 
that the number is given by 


_ 1 D 1 

,l\x) (m'— 


IN.0 mO y*' / 


m — 

V — 


= 2 (- 1 )' 
v-0 




m — 
n 



Hence the theorem. 

The theorem just proved shows that the mathematical expectation 
E (rn, 1, IfT) can be expressed by S(w, \gT) and is therefore expressible^ in 

(a,h) 

terms of the linear combinations of the coefficients of the polynomial fix). 

Corollary 1. Let d be a varying unit for which f \ are aU integers. Thm 

0 0 Q 


E (m, 8, [/(x)]") = 

(a.I>) 


E (7,1, [/(to)]" 

(It) 


^ See lemmas 3 and 4. 
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CoBOLLABT 2. Lei fi(x), • • • , fn(x) be n given polynomials. Then 
E (m, 8, [/d . •*. [/J) - r E (m, «,[/„+.••+ 

<0.6) (l***n) (o,6) 

Corollary 3. The number of compositions of m into n parts with each a< < x» 
< 6 <, (t « 1 • • • n) equal to 

/m + n-- (ai + •••+ an) + (ai — 6i — l)vi + • • • + (On — 6n ““ l)>'n l\ 


( m + (ai + • • • + an) + (ai — 6i — 

n - 1 


Proof. We have shown that the number of compositions of m into n parts 
with each a < x < 6 is given by 


Zc-i) 


, /n\/m ~ (a — l)n + (a — 6 — l)i^ 
VJV n -1 


Hence the number of integer solutions of the equation 

Xn + • • • + kini + • * * + + • • * + = m 

with each a, < < bp(p = 1 * • • s; ^ « 1 • • • n,.) is given by 


E E ••• E 

••*«!«) ri-O r,-0 


H(:) 


( roi — (oi — l)n< + (o< — bi — l)i'< — l\ 
nt- I ) 


«1 n, * /^\ 

■= E - E 

r|MiO »,-.0 {aal \*'«/ 

f ^ jQ /m< — (a< — l)n< + (a,- — 6< — l)i/< — iM 

\(m;l;m<) {.-I \ — 1 /J 

“ *e' (_i)'t+- 

Vi/ Vv./ 

/m - 1 - X) (oi - l)«i + E («i - *>i - 

\ ni + • • • + n, — 1 / 

The corollary follows at once by putting rii = • • • = n, = 1, s = n. 

The last corollary may be restated as follows: Let there be n stores, 
bif g bn being the numbers of stocks contained in 1st, 2nd, • • • , nth store 
respectively. Then m stocks containing at least ai stocks of the tth store can be 
taken from these stores in 

'*2^ (_2)ri+—+i'» /w + n—1 — 1)^A 

r|attO,***.ri^ \ n “• 1 . / 


— dififerent ways. 
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We have now established several c(»nbinatotial formulas omuserning Uie 
mathematical expectations of the product /i(xi) * * ■ f»ixn) under certain emkdi- 
tions. Apparently, there are many examples which can be solved by means of 
the results just obtained. For brevity, we may state a general criterion as 
follows: The mathematical expectation of a function, F(xi , ■ , Xa) say, can 

be estimated by the above mentioned formulas, if and only if 1) the sum of 
* 1 , • • • , Xn is known, and 2) there exigt n polynomials /i(x), • • • , /»(x) such 
that F(xi, • • • x„) is proportional to /i(x.), (t = 1 • • • n). The undetermined 
quantities in (xi, • • • , x») may or may not be continuous, if the quantities are 
discontinuous, the var 3 rmg unit is necessarily known. 


4. Convenient formulas for differences of zero.^ 

Given /(x) = |8o + jSi® + • • • + i8tx*(/3i 0) we may write 

(/- 1)‘'’ = t = i:/S.A'0*. 

^ «-0 «»>0 

where S,,t is a Stirling number of the second kind, as used by Jordan and defined 
by 

.15,., = A’O* = E (-1)'-* Q X*, 


A'O* being in the language of the calculus of finite differences, “a difference of 
zero^\ 

In terms of the differences of zero, the formulas (7) and (11) may also be re¬ 
stated as follows: 


(7)' 


E(m, 1, m 


z 


(n; 0 ; p) 


(m + n — l)l(m — w)ln!(n — 1)! 

(m — 5(p))!(/S(p) + n — l)l(m — 1)1 

X n —, OrA’O’ + • • • + /StA'O*)”’. 

»-.0 Pyl 


F(m,l,[/J ••• f/.]) = E* (-1)"-' 

< (!••• n) 


( 11 )' 


where 


^ (tTI -f- 7i — 1)1(771 7l)!(7l — 1)1 

(n^p) (m - S(pms(p) + n - l)l(m - 1)1 

X n + • • • + 5*A'0*)"’, 

Pyl 


f.{x)=^fia+ -t-/3,*x*, = ••• +/5-C. S(p) = l-Pi + + kpt. 


* The methods for obtaining convenient formulas for differences of zero as stated in the 
first part of this paragraph are similar to those used by Paul S. Dwyer in his paper ^The 
computation of moments with cumulative totals,'' Annals of Math, Stat,, Vol. 9 (1938), 
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Hie formulas (7)' and (11)' tell us that the difference of zero pla3rs an important 
rOle in the calculation of mathematical expectations of a polsmomial product 
under known conditions. On account of this fact, we are now going to investigate 
some recurrence relations and approximations for the differences of zero. 

As m is larger than t, we may find a convenient recurrence relation as follows: 


m! 






+ 


(16) 


— I Om,m+<+l 


= Xi(i + 





+ Xj(t + 





where 


(< + i)x,-.i(0 +i*X/(0 = + 1), Xo == Xf+i(0 = 0, Xi(0 = Xi(i + i) s i, 


and XjCO, * • • , X<(0 are all independent of m but depending on t. 

Starting with the first equation of (15) and using a well known relation (due 
to Jordan) Sm.n+i = + m-*Sm.n, successivcly, we get 




^ V t + j 


- g {« + i)x^i(0 + Mm (^+ j + 


m + ( + 1 
t + j+1 


(’ 
0- 




The recurrence relation thus follows. 

By successive applications of the relation Xy(t +!) = (< + j) •Xf_i(f) + i'X/(0i 
after nth time say, we may express X,(<) as a linear combination of X,(< — n), • • • , 
X,-n(< — n), but the coefficients are too complicated. 
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For i < 9, by applying the tecurrence relation as obtained abo^, the coeffi* 
dents may be exhibited as follows; 


( 

X,«) 

x.a) 

X*(0 

Xi(0 

1 





2 

3 




3 

10 

15 



4 

25 

105 

105 


6 

56 

490 

1260 

945 

6 

119 

1918 

9450 

17325 

7 

246 

6825 

66980 

190575 

8 

501 

22936 

302995 

1636635 

9 

1012 

74316 

1487200 

12122110 


x,«) 

XjO) 

X,0) 

X«(0 

10395 

270270 

136135 



4099095 

4729725 

2027025 


47507460 

94594500 

91891800 

34459425 


For example, when < = 4 we have, aQcording to the table: 

4-0-« - {(”■ ?■ V “ ^ V (”■ ^ V (” 8 ')}“'■ 

We shall now proceed to find some approximations for Sn,n+t and 
Firstly, we may write 

^ ~W- ^ (”+ i) + • • • + Ml) (” J ') • 

According to the recurrence relation we have 

(i) Ml) = (2< - l)X,_i(t - 1), 

(ii) X,_i(0 = 2(« - 1)-X,_s(t - 1) + « - 1)-X,_i(t - 1), 

(iii) X,_s(0 = (2t - 3)-X,-^(< -!) + («- 2).X,_,(t - 1). 

Hence 

Ml) = x^,(t) = (i - l)I2‘-V(0; 

\Ut) = 2‘-* £ (j _ 2 - j)!(« - 2 - i)-(t - 1.5)rait - 1 - i), 

l-o 

where 

<rik) = 2 , (t - 1.5)y = « - 1.5)(« - 2.5) -‘-(t-j- 0.6). 

Evidently, the orders of ^ l) ’ ^ ‘’ (”+” l) ^ 

n tends to infinity. 
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Novr, it can be easily found that 




< _ it - l)<(2t - 1) (1 

^ n 6»» ^ ”\n' 


, + 2‘-fI ^n + t\ 

-l)-n-t+l\2t - (n - < + 1).(2<)I 

1 /n*\‘ f2»‘«r(0 . (2« - l).2*‘«r«) . „/ 






Hence, we may write 


-Sn..+* = ^(:^)^i + ^‘ + ^^ + o 


'©)• 


where 


= < + 2* 




4*-tUlait) 


„ = -m - 1)(21 - 1 ) + ^(0 + ^^1 wo . 

Moreover, it can be shown by Wallis^ formula that 


i/'hr (a;- 1,2,3, •••). 


Thus we have 


'<« ■ S^(?) < I /; < { /> - 

'W -1 r- (?) > S r = 3-^ 


Again, by Wallis' formula we have 
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Combining these inequalities we<get 

Wi (1 - 2 )' < i' (*)- .(() < (f - 1 ). 

where 


IVT it - 2)» - ft*, («* “ 1) ~ ?**• 

Therefore, 

t + fVTV it - 2)* < Pi < t + fVr+1 (V? - 1), pi ~ ft*. 
Next, by Stirling’s formula 


nl = 0)V2™{l+i+j^ + o(i)} 


we obtain 


A» o"+‘ * ( A* /i+?}+^+ofiy~ f i+^, 

\€/ V 2 / tl \ \W/ e*-2^<! \ ^ 3n/ 


where 


== Pi + f t*, 

6* = P2 + "Api + 
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RANGES AND MIDRANGES' 

By E. J. Gumbel 

New School for Social Research, New York City 

1. Introduction. In the following the generating functions of the extremes 
are studied in order to determine the nature of the distributions of the ranges and 
the midranges. 

A large sample of size n is considered to be drawn from an unlimited symmetri¬ 
cal continuous distribution with zero mean. The difference and the sum of the 
largest and of the smallest observation, the extremes, are called range and 
midrange. R. A. Fisher and L. H. C. Tippett [2] have established the limiting 
distributions of the largest and of the smallest member of a sample. The exact 
conditions under which these distributions hold have been given by R. von 
Mises [4]. For a normal distribution, L. H. C. Tippett [7] has calculated the 
numerical values of the mean range and the first four moments of the range for 
sample sizes varying from 2 to 1000. He has shown that, for sample sizes 
exceeding 200, the correlation between the largest and the smallest observation 
may be neglected. Later, E. S. Pearson [5] has calculated the probability 
function of the range for small samples (n = 2 to 20) taken from a normal 
population. These calculations are very laborious. Recently, W. E. Deming 
[1] has applied the range to quality control. 

The concepts ^‘extremes”, ‘^range^^ and ‘^midrange’^ allow a simple generaliza¬ 
tion. Let fnX and Xm be the mth observation in increasing and in decreasing 
magnitude, hencefort|]^ called mth value ^ffrom below^' and ‘ffrom above^\ As 
long as the index m is small compared to the sample size n, the mth values under 
consideration are extremes. The difference and the sum of the mth extreme 
observations are called the mth range and the mth midrange. We will investigate 
the asymptotic distributions of the mth extremes, of the mth range, and of the 
mth midrange. Assuming that the number of observations is very large, the 
correlation between the largest and the smallest observation may be neglected. 
Then the mth range and the mth midrange are the difference and the sum of two 
independent variates, the mth extremes. 

It was found that the distribution of the mth range is skew and the distribution 
of the mth midrange is of the generalized logistic type, which is symmetrical. 
For m increasing the distributions of the mth extremes, the mth ranges, and 
the mth midranges converge toward normality. 

2. Generating functions of the mth extremes. Let (p{x) be an initial con¬ 
tinuous S3mmetrical distribution with mean zero; let be the most probable 
mth value from above; let be defined by 

( 1 ) ^»»(«.). 

m 

^ Research done with the support of a grant from the American Philosophical Society. 
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Under conditions given in an earlier paper {3] the distributions/«(»») and */(*,*) 
of the mth extremes are, for large n and gmull m 

(2) /»(*») «/.(-*,) 
where 

(^) 2/m ~ ^m(^m ““ Wm) 

is a reduced extreme mth value. 

The moment generating function Gm(0 of the mth extreme value from above 
obtained from the preceding equations is 

Introducing 


the integral becomes 


r 






m 


-m-f (f/ttin) 


whence 

(4) 


U„«) = 


T ^ ! V{m). 


To obtain the moments of the mth extreme value from above, the fundamental 
property of the Gamma function is used; whence 


t 

m — j' — — 


0-i)- 


r-i m — V 

Finally, by reversing the order of multiplication the moment generating function 
of the mth extreme value from above becomes 

(5) G„(<) = r (l - —). 

r-»l \ VCtm/ \ C^m/ 

The mean Xm of the mth value from above obtained from (6) is 

1 i 1 \ 

(6) = -{ifiTO-23- + c) 

a» \ »-i V / 

where c = .57722 is Euler^s constant. 

The seminvariant generating function LJS) of the mth extreme value from 
above becomes from (6) and (6) 
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and after e]q)ansioB 

(7) i««) = 1:5 -S'- 

where the sums 

(8) 

JbM>m 

are obtained from the sums 


(80 


S. 



v^2 


which are known from the theory of the Gamma function. The numerical values 
of the seminvariants of the mth extreme value from above Xr.m, being the coeffi¬ 
cients of i'/vl in the expansion (7) may be calculated from a table of the sums 
&, m given in an earlier paper [3]. 

From the generating functions (4) and (7) of the mth extreme value from above 
the moment generating function mG(t) and the seminvariant generating function 
mL(0 of the mth extreme value from below are obtained by the symmetric 
relation (2) as 

(9) n.G{t) = Gn.(-t); rnLit) = 

and the mean of the mth extreme value from below is 


(6^) • 

The seminvariants and mX, of the mth extreme value from above and from 
below are linked by 

(10) c4Xm., = (v - - (~l)^a;«X,. 

The standard errors <rm and of the mth extreme values are 

( 11 ) OtmO’m ~ m ~ 

This procedure for obtaining the moments of the mth extremes from their 
distribution (2) parallels closely that used by R. A. Fisher and L. C. H. Tippett 
[2] for obtaining the moments of the largest and smallest value. The special 
case m = 1 of the formulae (4), (9), (6), (6'), (11), and (10) leads to the moment 
generating functions, the means, the standard errors, and the higher semin¬ 
variants of the larjgest and of the smallest value given by these authors. 

The two parameters Um and «« which exist in the distribution (2) of the mth 
extremes may be calculated from the observations by virtue of equations (6) and 
(11). Thus, the theoretical distributions (2) may be compared to observations 
even if the initial distribution <p{x) is unknown. For increasing m, the distribu¬ 
tions of the mth extremes were shown [3] to converge toward normal distribu¬ 
tions, their means and standard deviations being given by (6), (6') and 11. 
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3. GMMnitii]|[ ftmdiotu <tf tbe mtii nngs and the mih midieitBe; To ofaftidQ 
the oharoctwistica for the mth range and the mth mid-range we Btate first Jome 
general properties of the sum and of the difference of two independrat variates 
X and y, with means X and f and standard deviations <r, and vy . Let the distad-' 
butions be ^(x) and ^(y), and let the generating functions of the seminvariaatB 
Xa,r and Xy,,be Ls(0 and 2^(0. We write 

(12) p IB X + y; w=>x — y 

from tire sum and the difference of the two variates. Then the means if and O 
and the variances <r* and vi are 

(13) if = 2 -1- y; ® = 2 - y; vl <rl + vl ^ vi' 
and the seminvariant generating functions L,(0 and L^{t) are 

(14) L.(<) » L,«) + 1^(0; LUi)L.(t) + Ly(-t). 

The n^ative sign in the second equation (14) is obtained through the same well- 
known derivation as used for the first equation (14). Therefore, the even semin- 
variants of the sum are equal to the even seminvariants of the difference, whereas 
the odd seminvariants of the sum and of the difference are 

X»,srfi = + Xy,s,+i; = X«,jrfi + (—l)*'^*X»>+.r. 

If the distributions ^(x) and ^(y) are symmetrical one to another in the sense 

(15) ^(y) = v(-x) 

then the even seminvariants of the two variates x and y coincide in size and s^ 
and the odd seminvariants coincide in size and differ in sign. Under the condi¬ 
tion (15), the even seminvariants of the sum v and the even seminvariants of the 
difference to are twice the even seminvariants of the variates x or y. The odd 
seminvariants of the sum v are twice the odd seminvariants of the variate x, 
and .the odd seminvariants of the difference to vanish. 

We apply these properties to the mth extreme values and write x* for x and mX 
for y. According to (2) the distribution of the mth extreme from above is sym¬ 
metrical to the distribution of the mth extreme from below in the sense (15). 
The mth range io« and the mth mid-range o«, are defined by 

(12^) tOm " Xm mXf Vm — Xm “f" mX» 

The mean ©„ of the mth range, the mean o* of the mth mid-range and the 
respective variances and are, from (6), (6') and (11) 

2Stm 

(130 = 22« ; t>» = 0; vj, = 2«ri = 2,»<r* = = 4,. 

CXm 

The seminvariant generating ftmctions Lv(i, m) of the mth range and Lv(i, m) of 
the mth mid-range obtained from (7) and (9) are 

(140 L„(<, m) = 2 £ — S,.B - 2L„ii); L,(t, m) = 2 £ . 
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The seminvariants of the mth range are twice the seminvariants of the mth 
extreme values from above. The even seminvariants of the mth mid-range 
the even seminvariants of the mth range are twice the even seminvariants of 
the mth extremes. The odd seminvariants of the mth mid-range vanish. 'There¬ 
fore, the distribution of the mth range is skew, and the distribution of the mth 
midrange is sjmimetrical. From the convergence of the mth extremes towards 
normality it follows that the mth range and the mth mid-range tend also, for 
increasing indices m, toward normality. 

The seminvariants of the range and the mid-range are obtained from (13') 
and (14') by putting m = 1 and omitting the index 1, Therefore, the standard 
errors and of the range and of the mid-range are 


(13") 



\/2 (Tn = Otffv 


where a is given by (1) and crn stands for the standard error of the first or the 
last value. The skewness Pi,y, of the range and the excess jS 2 — 3 for the range and 
the mid-range 


(16) fiiu, = .64928; /Sa.to - 3 = 1.2 = ft. ~ 3 


are only one half of the corresponding characteristics of the largest value. The 

distribution of the range is less skew and less concentrated toward the middle 

than the distribution of the largest value. The moments (13) and (16) of the 

range and of the midrange may also be obtained directly from Fisher’s and 

Tippett’s results [2] when independence of the two extremes is assumed. The 

numerical values (16) for the limiting distribution of the range differ considerably 

from the values 
% 

(160 = .309; ft.. - 3 = .54 


given by Tippett [7] for n = 1000 observations. For increasing n the approach 
of the distribution of the range toward its limiting distribution is very slow. 
This is reasonable as the approach of the distribution of the largest value toward 
the limiting distribution is also very slow. 

Until now we considered symmetrical initial distributions. The case of an 
asymmetrical initial distribution may be dealt with briefly. The most probable 
mth extreme value from below mU differs then from — Un and 


(10 


mOt 


m 


differs from «« . The distribution of the mth extreme value from below is for 
large n and small m [3] 


(m — 1)1 


m„iV—me»»»v 
6 


(jy) 


ffi/(m3j) — mO£ 
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.wliere ■ "= 

(30 mV ** mOt(mX — «w) 

is a reduced mth extreme value from below. The moment goierating functicm 
jS(t) of the mth extreme value from below becomes 


(40 


0(t) - e-“‘m-“'-“>r (« + -^)/r(m) 


and the moment generating functions G»(t, m) and 0,{t, m) the mth range and 
of the mth mid-range of an asymmetrical distribution obtained from (4), (40 
and in analogy to (14) are 


(17) 


(?«(<,m) = (n- /r*(m) 

(?,«, m) = ^m- + r*(m). 


Thus, the moments of the mth range and the mth mid-range may easily be com¬ 
puted even for an asymmetrical distribution. 


4. The distribution of the mtii midrange. In the following we return to a 
symmetrical distribution and establish directly the distribution f{vm) of the mth 
midrange. Then, the generating function (14') and the convergence toward 
normality will be verified. 

From (12') the distribution of the mth midrange is 

f(Vm) “ dflJm . 

Introducing (2), the equation is written 

Using as before 

e”"" = z 


the integral becomes 


e 


f 




e"“"’-(2m - 1)1 
m*“(l -H e-"-'")*"’ 


The distribution of the mth midrange is therefore 
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Hie distribution (18) will be tiiown to lead to the seminvariant gttiexatmg 
function (14')- Hie generating function of the mth midrange obtained from 

(18) is 

^ r* I dOmV^ . 

(w-l)l*i« U-re ; U«,». 

Introducing 

1 + e““*" = u~‘; du =* —m*c““*** da»,Vm 
the integral is rewritten 

r (1 - 

(m - 1)P Jt ^ ^ 




r(2m) 

r*(7») 


r(» + l)r(«-l) 


r*(7») r(2OT) 

Consequently, the moment generating function of the mth midrange is 


r(m + —^rfm- 

=a \ Q^/ \ «m/ 


(19) (?.(«, m) = . 

This expression is identical with the product of the moment generating functions 
(4) and (9) of the extreme mth values. Therefore, equation (18) is the distribu¬ 
tion, and equation (14') is the seminvariant generating function of the mth 
midrange. 

For m = 1 the distribution (18) of the mid-range becomes the so-called logistic 
distribution which is coihmonly written 




(1 + e““’)^ 


Accordingly, (18) may be called the generalized logistic distribution. The proba¬ 
bility F(v) of a value equal to, or less than, v obtained from (18') is 

(20) Fiv) = 1/(1 + e-n- 

Therefore the distribution/(v) may be expressed by the probability F{v) through 


fiv) = aF(t;)(l - Fiv)). 


Formula (20) considered as a growth function plays a r61e in population 
statistics where it was introduced about 100 years ago by Verhulst [8]. Recently, 
it has been widely used by R. Pearl [6]. In his treatment, the value v stands 
for the time, and the function Fiv) stands for the relative size of the population 
at time v compared to its alleged asymptotic size. 
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In the foUowing, the influence of m On the <flstributi(m (18) is etudied.' The 
distribution oi the reduced mth mid-range 

( 21 ) * Omy/rn V 

is, from (18) 


( 22 ) 


0m(*) 


1 (2m — 1)! e*v^ 

1)P (1 + e./VS)>»- 


The probability density of the mean mth reduced mid-range increases with m. 
Indeed, 


^i*+i(0) ^ 2m "t* 1 1 

^(0) 2V'm(m -i* 1) 

Therefore, the standard error of the reduced mth mid-range decreases with 
increasing index m. This is reasonable as the mth mid-range is an estimate of 
the median for the initial distribution. The larger m, the nearer are the mth 
extreme values to the median, and the better is the estimate. 

To verify that the distribution (18) of the mth mid-range tends toward nor¬ 
mality for large indices m equation (22) is rewritten 

Ig - Ig 0„(O) + zy/m - 2m Ig . 

Expansion of the exponential and the logarithm leads, if we neglect the third 
and higher powers of the deviation z, to 

Ig ««(z) = Ig 0«(O)-H - 2m ^ 

whence by virtue of (11) 

f{v) «= Const e"“ 

The distribution of the mth mid-range becomes normal for large indices m. 
The mean is zero, and the standard deviation is 

(23) 

This is in accordance with the statement (13'), as &,m tends with increasing m 
toward 1/m. 



6. Sumxnary. For initial symmetrical distributions the seminvariant gen¬ 
erating functions (14') of the mth range and the mth mid-range are obtained 
from the seminvariant generating functions (7) and (9) of the mth extreme 
values from above and from below. As the two mth extreme values are sup¬ 
posed to be independent our results hold only for very large, sample sizes. The 
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even eeminvariants of the mth mid-range and of the mth range coincide. The 
distribution of the mth range is skew and the distribution of the mth mid-range 
is the generalized symmetrical logistic distribution. For increasing indices m 
the distributions of the mth extremes, the mth ranges, and the mth mid-ranges 
converge toward normality. 
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NOTES 


This section is devoted to brief research and expository articles on methodology 
and other short items. 


NOTE ON ASYMPTOTIC VALUE OF PROBABILITY DISTRIBUTION OF 
SUM OF RANDOM VARIABLES WHICH ARE GREATER THAN A 
SET OF ARBITRARILY CHOSEN NUMBERS 


By Bkadford F. Kimball 

New York Public Service Commission, New York City 


The purpose of this note is to present the following theorem which the author 
needed in connection with a computational problem. Since the theorem has 
general implications for statistical theory which do not seem to have been 
brought out heretofore, readers of this journal may find it of interest. 

Theorem: In Euclidean space of n dimensions with coordinates Xi {i = 
1, 2, • • * n) let Ox be n constants whose sum is uq, and consider a hyperplane 

(1) X\ X% Xn ^ u, . U > lit). 


Let k denote a positive constant, and I(u) the n 1 fold integral defined by 

(2) I{u) f * ‘ * J d,X2 • • • dXn^i 


taken over that part of the hyperplane for which Xi > at . Then 
(3) lim 

<U -»00 

Proof: The integral may be reduced to another integral by reduction of the 
quadratic form 

n n-X / n-1 \2 

t-1 t-1 \ / 

to a sum of squares y\ such that yi does not involve Xj lor j < i. Dropping the 
superscript n — 1 in the notation for X and letting the subscript on X denote 
the least value of i involved in the sum, the expansion of this form may be written 

2^^ Xi — 2u^ Xi + 2^ Xi Xj + u^, i < j. 


The transformation may be performed progressively as follows: 

iV2 xif + 2 (xj + aiJs + • • • -m) a:i + [(m - 2] Xi)/\/2]* = y\ 

2 

giving 

yi = \/2 xi — {u — Yj Xi)l-\/2. 

2 
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Using the remainder of the terms involving xtf one can complete the square 
with terms not involving and the value of is 

y/8fV2 - (w - xdlVQ. 

s 

Continuing until only terms involving Xf^i and u remain, 

yn-i * Xn^i y/n/y/n — 1 — u/\/n(n — 1). 

The renm-ining term in u will be found to be u^/n. 

Making the above transformation, the integral becomes 

(4) I{u) *= / • * ■ / dyidy^^i. 

In order to fix the limits of integration on it will be noted that the projection 
of the critical region of the hyperplane (1) upon the n — 1 dimensional space in 
the original variables Xi delineates a region in that n — 1 space bounded by the 
w — 1 hyperplanes 

Xi - Qiy i = 1, 2 • • • , n — 1 

and the h 5 rperplane 

a:i + a:2 + Xs + • • • + iCn-i ^ u — . 

Hence, if (2) is considered as an iterated integral with the integration per¬ 
formed in the order of the subscripts of Xi , the intervals of integration are 


ai 

< Xl 

< w — a„ — 2 oJi 

2 

Os 

< X 2 

<W — 0 » — Cl — 

Or 

< Xr 

< M — - (uo — 2 0 <) 



r+l r+l 

On —1 

VI 

< M — Wo + On-l , 


where it is recaUed that 2^ denotes 23 

r+l t—r+l 

Now transforming to y, and using the general transformation equation 
(6) 2/r = Xry/r + \/y/r— {u — ^Xi)/y/{r + X)Ty 1 < r < n - 1 

r+l 

where 2 j = 0 under definition of summation symbol noted above, 

n 

Lower limit of 2/r = - (w — 23 xii)/y/{r + l)r + Ory/r + 1/V7» 

r+l 

Upper limit of j/r = (w - Z) y/Tly/r + 1 ~ (wo - 23 «») y/r + l/Vr i 

r+l r+l 

since 

y/r ^ \/y/r — l/y/ (r + l)r = y/r/y/r + 1. 
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It is not difficult to show tiiat 

]C *< = C,+iiyt) + (n — 1 — r)u/n 

f+1 

where Cr+i(i/i) denotes a linear combination of yi for t > r + which does not 
involve the variable tt. In other words 

M — 22 *< = (r + l)u/n — C,+i( 3 /i). 

f+1 

Making this substitution, the limits on Pr are found to be 
Lower limit of y, * — (u/n) (\/r + l/y/r) » 

+ C'r+i(y<)/\/ (t + !)»• + r + l/\/r» 

Upper limit of y, = (u/n)(-\/r(r + 1) - Cr+i(yt)y/r/\/r + 1 

-(«#-• 2 o<)Vr + 1/Vr. 

r+l 

It will now be clear that as u becomes infinite, the limit of the integral in ( 4 ), 
will be the ra — 1 fold integral taken over the whole of n — 1 space. This latter 
integral is easily evaluated to give and the theorem follows. 

The following two corollaries, which are restatements of the theorem in less 
general form, bring out the implications for statistical theory. 

Corollary 1. With k = 1/2 define F„(u) by 

(6) FM = (2»)-“'*7(m). 

The differeniial F„(u)du represents then the probability that n random variables X{ 
taken from a normally distributed population with zero mean and unit standard 
deviation, fcdl into a region 

(7) Oi < Xi 

and have a sum with value in the neighborhood (±i du) of u. 

Recalling that uo is the value of u when each has value a,- 

|»00 

(8) P[o< < a;,] = / F«{w) du 

is the probability that all values of x fall into the region (7). Denoting the normal 
probability function by <l>{t), corollary 1 implies that 

(9) lim y/n F„(u)/<l>(.u/y/n) = 1. 

Since (l>{u/\/n)du/\/n represents the probability distribution of the sum of 
the n random variables when the condition (7) is removed, certain implications 
for the theory of statistics emerge. One of these is noted in the example given 
below. Corollary 1 can be stated in a different form as follows: 

Corollary 2, If pa {u)du denotes the prohability that the sum of n random varia- 
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blesfrom a normally distnTnUed population be in the neighborhood (=fc§ du) of u, and 
Pb(u) denotes the probability that the sum of n random variables from the same 
population that do fall in region (7) have a value in this neighborhood of w, it follows 
from (9) and the nature of the functions integrated that for arbitrarily small positive 
8, a value of u, say u' can be founds sufficiently large^ such that for allu> u\ 

(10) (1 - 8) Pa{u) < pa{u) P[ai < x,] < Pa{u), 

Rate of convergence. The author having had occasion to compute Fn{u) for 
values of n from 2 to 5, and a,* = 0, a table showing the rate of convergence of 
Fn{u) to its limit for this range of n (and a,- = 0) is shown below. In this table 
the values of the ratios of the minimum values of u (= u') to the standard devi¬ 
ation of w (= V^) are shown for a sequence of values of 8 which approaches 
zero. 

Rate of Convergence of \/n F«(w)/0(w\/n) to Unity for ai = 0 


Critical Ratio u^y/n 


s 

n - 2 

n = 3 

n « 4 

n - 5 

.6 

0.67 

1.34 

1.92 

2.46 

.25 

1.15 

1.95 

2.65 

3.27 

.1 

1.64 

2.60 

3.40 

4.11 

.05 

1.96 

3.02 

3.89 

4.65 

.01 

2.58 

3.85 

4.87 

5.76 

.001 

3.29 

4.84 

6.03 

7.08 


Example, An example showing the possible bearing of this theorem upon 
practical considerations of sampling is the following: If in a quality control 
problem, samples of size 4 were used, and a follow-up of samples which showed a 
large deviation of sample mean were pursued, a case of a particularly large devi¬ 
ation such as 4 would possibly receive special attention. With sample mean 
equal to 'a/4, the ratio of this mean to its standard deviation is u/2. Turning 
to the table, in column n = 4 it will be noted that the value of 8 for uy2 = 4 is 
somewhat less than .05. Hence from (9) and (10), for u > u' (u' = S, n = 4), 

,95 <l){u/y/n)/\/n < Fn{u) < <l>(u/\/n)f\''n. 

It follows that 

1*90 pOO pOO 

.96 / <t>{u/\^n) duj\/n < / FJ^ du < I t/>(u/\/n) dufy/n^ 

If one now considers a set of random samples of size 4, the last integral on the 
light represents the expected proportion of the set which falls into the sub-set A 
for which u>u' (and hence with deviation of mean relative to standard devi¬ 
ation of mean greater than w'/ y/'n) , The middle integral represents the expected 
proportion of the original set which falls into a sub-set B for which u > u* and 
Xi > 0. It follows from the inequality on the left that the expected proportion 
of the sub-set A which falls into the sub-set B is greater than 95 per cent. Hence 
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one infers that the probability is greats than .05 that for a samj^ showing sach 
a large deviation from the mean (w/Vn * 4, n * 4) aH the constitueiit eloo^t^ 
will have deviations on the same side of the population mewi. Uma if aU the 
elements of the sample investigated are found to have deviations On tite same 
side of the population mean, this could not be construed as addititmal eoiden^^ 
that the sample indicated an abnormal condition. 

This conclusion is weaker than the facts of the example warrant, since it is 
based upon the integral of Fn{u) from w' to infinity. Unfortunately the author 
does not have data available on the rate of convergence of these integrals. 


NOTE ON A MATRIC THEOREM OF A. T. CRAIG 

By Harold Hotelling 
Columbia University 

An extremely elegant theorem given recently by A, T. Craig^ and applied 
by him to establish a further theorem on independent x distributions may be 
stated as follows: 

If A and B are the symmetric matrices of two homogeneous quadraiic forms in n 
variates which are normally and independently distribiUed with zero means and uni! 
variances^ a necessary and sufficient condition for the independence in probability 
of these two forms is that AB = 0. 

The proof given that the condition is sufficient is adequate, but Craig’s 
treatment of its necessity consists essentially in its assertion. In view of the 
growing interest in such quadratic forms, for example in connection with serial 
correlation, the neatness of this theorem is likely to lead to a wide usefulness. 
It therefore seems worth while to give a complete proof of the necessity condition. 

The form with matrix A is denoted by Qi and that with matrix B by Qj. 
The characteristic functions, if defined as and are respectively the 

reciprocals of the square roots of the determinants of the matrices 1 — XA and 
1 — fjiB^ while the characteristic function for Qi and O 2 together, 
is the reciprocal of the square root of the determinant of 1 — XA — mB. A 
necessary and sufficient condition for independence is therefore that 

|1-XA|-|1-mB|s|1-XA--a*B| 

shall hold identically for all values of X and y. Since the determinant of the 
product of two matrices is the product of their determinants, the left member is 
the same as 

I 1 — XA — /iB + XpAB I . 

From this it is immediately obvious that AB = 0 implies the independence of 
the two forms. The converse will now be proved. 

1 “Note on the independence of certain quadratic forms,** Annals of Math, Stat.f Vol. 14 
(1943), pp. 195-197. 
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We shall assume therefore that Qi and Qs are independent, so that the identity 
holds, and prove that AB « 0 . 

It must not be supposed that Qi and Qs can by the same linear transformatimi 
be reduced to forms in which product terms are absent and only terms in the 
squares of the variates appear. The available theorems* leading to this canonical 
form require that at least one of the quadratic forms be non-singular. But it is 
of the essence of the present situation that both Qi and Qa be singular, since this is 
implied by AB « 0. It does not appear possible, for example, to reduce to this 
canonical form the pair x\ + xl and xl + 2 x 1 X 3 . 

Nevertheless a real orthogonal transformation can be found reducing Qi to 

dix* + • • • + drxj, 

where r is the rank of Qi, Thus there exists an orthogonal P such that A = 
PLP~~^ and B = PMP"^, where L and M, when partitioned so as to separate the 
rows and columns into successive groups of r and n — r, are of the forms 



Here D stands for an r-rowed diagonal matrix having di • • • dr in its diagonal, 
0 for various matrices whose elements are all zero, and E, C and F for arbitrary 
matrices of appropriate dimensions. Then 

I 1 - XA I = 1 P(1 - XL)P“' I = I P i • 1 1 - XL I • I p-“' I = I 1 ~ XL 1 , 

and in the same way, 

I 1 - mB i = I 1 ~ I 
and 

I 1 — XA — nB I = I 1 XL — fiM I . 

We thus have 

I 1 - XL I • fl - I = I 1 - XL - I . 

From this identity it follows that a pair of forms Q* and Q* , quadratic in a set of 
variates normally and independently distributed with zero means and unit 
variances, and having matrices L and M respectively, are independent. 

Since AB = PLMP~^^ the theorem will be proved if we can show that LM = 0. 
Let 



so that M = Ml + - 3 :^ 2 . Since obviously LAf 2 = 0 , we need only to show that 
LJIfi = 0. 


• E.g., B6cher, Iniroditciion to Higher Algebra, pp. 169,306. 
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let Q* >• <2^ + ivhere O' mi am quadratie fonns in i3ie ^artidular 
normally distributed variates conridered, and have matrices Mi and Mt lespee* 
lively. Since Q" obviously does not involve any of the vaiiatiBS oecuning hi 
Qi and since all the variates are independent it foUonvs that Q* isindepenctentirf 
Q'. Since it has been shown above that Q* is also independent Qt , it must be 
independent of tiie difference Q* — Q* = O'. Therefore 

u - Xi I • I 1 - i«Afi I * I 1 - XL -/Jfcfi I. 

We have; 


r 


U-M<l “ Ila-Xd,). 

M 


Also, 


1 _ XL - /lAfi 


1 

9 

1 

wH 

-mCI 

L -mC' ! 

_J 


Consequently equating the terms of highest degree in X on the two sides of the 
identity 


n(l - \di) I 1 - mMi I a I 1 - XL ~ mMi I 

yields the identity in y., 


u - mMi I S 1 , 


or upon putting y = 1/x, 


I ilfx - a: 1 = i-x)\ 


Hence all the latent roots of the real symmetric matrix Mi are zero. Now for 
a symmetric matrix the sum of the squares of the latent roots equals the sum 
of the squares of the elements, since both equal the trace of the square of the 
matrix. Therefore Mi = 0. Consequently Lilfi = 0 and the theorem is 
established. 

Since JJfi = 0 , the following further result is obvious: 

Two independent quadratic forms in a set of variates normally and independently 
distributed with zero means and a common variance can hy a transformation be 
reduced to two forms having no variate in common. 

But one of the disjunct sets of variates in the forms as thus reduced is not 
necessarily independent of the other set. For example, if xi, a:s, , 0:4 are 

normally distributed with equal variances and any fixed non-vanishing correlar 
tion, the same in each of the six pairs, the sets {xi , x^) and {xz , x^ are not inde¬ 
pendent of each other, but the forms {xi -H x^f and (xz — are, since Xi-]r x% 
is uncorrelated with Xz — Xi . 

I am indebted to Professor E. R. Lorch in connection with the preparation 
of the present note. 
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A NOTE ON THE BSHRBNS-FISHER FROBIEM 

By Henry ScheffA 
Syracuse Univenity 

A commonly occurring problem of statistical inference is the comparison of 
the means of two normal universes when the ratio of their variances is unknown. 
Let (a;i, * * * , xj) be a random sample from one normal population with mean a 
variance n, and (yi, • • • , Vn), a random sample from another with mean p 
and variance v. The problem is then that of making statistical inferences 
about the difference B of the means, B — a — when the ratio n/vis unknown. 
Convenient tests and confidence intervals are available if one can find a linear 
form L and a quadratic form Q in the vector (xi, • • • , , yi, • • • , Vn) with 

coeflSicients independent of the unknown parameters a, /3, /i, Vy such that for some 
positive integer the quotient 

(1) (L - S)/(Q/k)* 

has the /-distribution with k degrees of freedom. For this it is sufficient that the 
following conditions be satisfied for all values of the parameters: (i) L and Q are 
independently distributed, (m)^ E(L) = B, (iii) Q/a has the x-^istribution with k 
degrees of freedom, where e is the variance of L, 

In a recent paper [1] the author investigated the Behrens-Fisher problem as de¬ 
limited by the above three conditions,* and among other results arrived at a simple 
solution. This solution however does not have the property that the quotient 

(1) is symmetric, whereby in this note we shall mean the following: A function 
of the samples and parameters will be called symmetric if it is invariant under 
permutations of the x^s among themselves and of the y*8 among themselves. 
Let us therefore formulate condition (it;): the quotient (1) is symmetric. Since 
(it;) would be extremely desirable, both for practical and theoretical reasons, 
and since the author has received several inquiries on this matter, it is considered 
worth while to outline a proof that conditions (ii) and (iii) imply that (it;) cannot 
be satisfied, in other words, there exists no ‘‘symmetric solution” of the Behrens- 
Fisher problem within the framework we have imposed. Perhaps this is a 
simple example of a larger class of problems in which an approach, natural in the 
light of past developments, forces us to an asymmetric solution. 

Suppose (it;) is satisfied. By substituting special values for the vector (xi , 

• * * > , 2 / 1 , • * • , 2/n) and then making permutations allowed by (iv) we find 

that L and Q must be of the form 

(2) L = Cl + C2 X) yy, 

* 1 

(3) 0 c* 2 icj + C4 2) XiXif + c« X! y* -h Cc 2 Viyi* + c? 2 > 

i i 

1 E(f) denotes the expected value of /. 

* Although these conditions appear simpler than those in [1] they may be shown equiva¬ 
lent. 
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where all e’s ate indep^d^t of ptuameteni, and tlie range (tf indioee t, is 
from 1 to m, the range of from 1 to n. 

Condition (it) requires that 

(4) £(L)««-/3. 

But from (2), 

(6) £(L) = cttna + Cjn/8. 

Since (4) and (6) must be satisfied identically in «, p, it follows that ci «= 1/m, 
c» = — 1/n, and thus the variance of L is ' 

(6) <r* = li/m + y/n. 

Because of condition (Hi) we must have E(Q/a^) = k, and combining this with 

(6) , we have 

(7) E(Q) = fc(M/m + y/n). 

However, from (3), 

(8) E1(Q) = Cttfi(jn + a) + C 4 (m* — m)a* -|- c^(y -f- /3*) 

+ Ct(r^ — n)|8* + cmnaP. 

Equating (7) and (8) gives us an identity in a, p, n, v from which we can determine 
the c’s, and after putting these back in (3) we find that the result may be written 

(9) Q = A:[S,/(m* - m) + /Sy/(n* - n)], 
where 

-S* = 52 (*< - *)*» * = S Xi/m, 

i i 

-Sy “ 52 (Vi - yf, ® = Z yj/n. 

’ ’■ 

The last step of the proof consists of showing that Q defined by (9) violates 
(Hi). Write wi = S^/ft, «2 = SJy. Then Ui and t(* have independent x*- 
distributions. Now (9) states that u = Q/v* is of the form u = aiUi + , 

where oi and ot are constants. Let <!>({), <t>i(t), <h{t) he the respective characteristic 
functions of w, . Then ^(<) = ^(ait)<tn((hf) because ui and m* are statisti¬ 
cally independent. Since the characteristic function of a x**variable with r 
degrees of freedom is (1 — it is evident that u has a x*-distribution if and 

only if Oi = Os = 1. From (9), 

oi/o* = - m)]/[«'/(n® - n)], 

and thus a necessary condition (it is also sufficient) for Q/a to have a x*-dis- 
tribution is 


(10) 


n/y = (m* — m)/(n* — n). 
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But (Hi) states that Q/a* has a x*-diBtribution for oB parameter valim. This 
contradiction completes the proof. 

We remark in dosing that we have at hand a counter example of practical 
interest to the statement found in several statistics texts that if z is a normal 
variable with zero mean and v is an independent unbiased quadratic estimate of 
the variance of z, then z/v^ has a ^-distribution. The counter example consists of 
taking t - ^ ^ — h and v =» Q/k defined by (9). It may be shown that z/»* 

does not have a ^<listribution except in the trivial case (10). 

REFERENCE 

[1] H. ScheftS, AnnaU of Math, StaU, Vol. 16 (1943), pp. 36-44. 

ON MULTIPLE MATCHING WITH ONE VARIABLE DECK 

T. N, E. GEEViUiB 
Bureau of the Census 

The problem of card matching has been considered by a number of writers. 
A complete bibliography has been given by Battin [1], who also published the 
most general treatment of the subject to date, dealing with the simultaneous 
matching of any number of decks of arbitrary composition. He considers, 
however, only the case in which the order of every deck is variable, all possible 
permutations being equally likely. Some interest attaches to the case in which 
all the decks but one have fixed orders in relation to one another, especially in 
connection with radio experiments in telepathy, where a large number of subjects 
simultaneously attempt to call the same target. 

The simplest case is that in which the target for each trial is chosen at random, 
independently of the other trials. If the target is to be selected from s possi¬ 
bilities, and if p< denotes the probability that the iih. possibility will be chosen 
as the target, while m,* denotes the number of subjects who call the ith possibility, 
then the mean value of A, the number of correct calls is, of course, 

« 

( 1 ) , 

and the variance is 

( 2 ) = 

Evidently, the mean number of hits for a succession of trials is the sum of the 
means for the individual trials, and the variance is the sum of the variances. 

A slightly more difficult problem is presented when the target series is a true 
“deck’^ that is, when its composition is determined in advance, only the order 
being left to chance. Let n denote the number of trials and = 1, 2, • • • , «) 
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m 


tlie number of targets of the tth land so that = «• Also let ♦»<#(; « 1,2, 

• • •, n) denote the number of subjects who eaU the i'th possibiUty on the ith 
trial, and let m< denote the total number of such calls in all the n trials, so that 

n 

ffK » ^ mu . Then, using the ingenious type of counting function introduced 
by Wilks [6] and improved by Battin [1], we define 


*-5jg 


where AT = nl /n n*!. We also define an operator such that if w = u{xi , 

x7* in u. Then, if h 


X 2 , * • • , then KniU is the coefficient of Xi^xS^ 
denotes the number of hits 


P(h = r) = coefficient of e" in 


Also, 


Em = 


*-0 


It follows immediately that 

( 3 ) Mh fitmi = piMi , 

n M 

where p* is written for n,/r?. Similarly, it can be shown that 

Em = -E(niE mh) + , - Z rn,(ni - 1) E mutna 

H imml \ ]’mml / flijl ~ 1 ) »««1 | 




+ 


n(n 


i [-."i t 

L 


Mikinji 


1- 


j 


If we write X<, = E , it is found that 

1^1 

(4) F* = —TZ — mh - E PiPAnKii - m<m/) . 

n — 1 Li —1 1 J 

It should be noted that it is not necessary that the total number of subjects be 
the same for each trial. 

Certain special cases are of interest. When there is just one subject for each 
target, as in the ordinary matching of two decks, for i = j, and 0 other¬ 

wise; and Xirrn == Then, if vi denotes mi/n and pi = 1 — iri, and if ji = 1 — p<, 

(6) Mk = nSiPiTTi and Vh = ^ [S<p»g<ir<p< + ^aPiPj^irivily 

n — 1 ty? 
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a compact expresaon which is equivalent to the forms pireyiously given by 
Stevens [4,5], Greenwood [2], Battin [1], and the author [3]. 

On the other hand, if the different kinds of targets occur with equal frequency, 
so that every p< =* l/« and every = (« — l)/«; and if v* *= Sim* and v « 
SiWi = XkVk , the expression (4) becomes 

(6) Vk = I*'* ” + sXiinKti — m*)]. 
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Collected Papers, Edited by E. S. Pearson and John WtsHABT. 

Biometrika Office, London, pp. xiv + 224. 1942. 15 shillings. 

Since the question is still asked from time to time, it may be well to state that 
‘Student’ was the pen name of William Sealy Gosset. After taking bis degree 
in mathematics and natural sciences at Oxford, Student worked as a brewer for 
the well-known firm of Messrs. Guinness—^in Ireland from 1899 till 1936 and 
thereafter in London as head of the new Guinness brewery established there. 
He died in 1937 at the age of 61. Between 1907 and 1937 he published twenty- 
one papers and a few notes; these are issued in the present volume as a tribute 
from a group of his relatives and friends. 

Student’s name is almost universally attached to a single discovery, the ^-test, 
which requires for a complete proof more powerful mathematical methods than 
he devised. These facts may cause his papers to be regarded as museum pieces 
by people who have not read them. Actually, in many respects no better model 
than Student could be suggested for a young statistician today. Since we some¬ 
times speak derisively of courses of lectures in statistics where the methods and 
ideas are “20 years out of date”, it is interesting to find that many of the funda¬ 
mental ideas considered most in need of emphasis by forward-looking teachers 
today were in fact emphasized by Student in his writings over 30 years ago. 

For example, his classical paper on the f-test, published in 1908, opens as 
follows: “Any experiment may be regarded as forming an individual of a ‘popu¬ 
lation’ of experiments which might be performed under the same conditions. 
A series of experiments is a sample drawn from this population. 

Now any series of experiments is only of value in so far as it enables us to form 
a judgment as to the statistical constants of the population to which the experi¬ 
ments belong”. 

The idea is elaborated in a second paper published in the same year. “Note 
that the indefinitely large population need not actually exist. In Mr. Hooker’s 
case his sample was 21 years of farming under modem conditions in England, 
and included all the years about which information was obtainable. Probably 
it could not actually have been made much larger without loss of homogeneity, 
due to the mixing with farming tmder conditions not modem; but one can im¬ 
agine the population indefinitely increased and the 21 years to be a sample from 
this.” We note here a further observation, not always appreciated, that in some 
lines of research samples which are both large and homogeneous do not exist. A 
clear conception of the relation between sample and population is evident in all 
his work: one further quotation, the summary from his 1926 paper, Math^rrudics 
and Agronomyy will suffice. “To sum up, in planning agronomic experiments use 
plenty of replications and make quite sure your results are capable of being con¬ 
sider^ to be a random sample of the population about which you wish to draw 
conclusions.” 
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Another noticeable feature of Student’s work is his concern about the extent to 
which the mathematical model underlying his statistical techniques really repre¬ 
sents the facts of the data. Further, he is always ready to give his opinion, from 
considered thought, as to whether the discrepancies between the model and the 
facts will affect the conclusions materially. In his first paper, On the error of 
counting with a hacmacytorneter, he shows that the Poisson distribution applies to 
the total number of cells or corpuscles which are found in a divii^on of the hae- 
macytometer field. He is careful, however, to point out three wajrs in which 
deviations from the Poisson law may occur in practice: (i) there may be inter¬ 
ference between the particles, though this is considered unlikely in high dilutions 

(ii) there may be clumping and (iii) (probably the most important, in his view) 
the drops taken out for counting may not represent the bulk of the liquid, giving 
rise to an additional sampling error superimposed on the Poisson error. In a 
later paper An explanation of deviations from Poisson^s law in practice (1919)^ he 
makes a critical examination of the assumptions necessary for Poisson’s distribu¬ 
tion, and shows the type of distribution to be expected from the invalidity of one 
or another of the assumptions. 

Similarly, in the opening paragraphs of the 1908 paper on the <-test, he notes 
that the assumption of normality in the parent distribution may not hold in 
practice. Then follows his opinion on the importance of this assumption: *Tt 
appears probable that the deviation from normality must be very extreme to lead 
to serious error.” 

His earliest papers deal with attempts to find several exact small-sample dis¬ 
tributions for which he and other workers in Britain felt a need. Lacking the 
mastery of probability theory necessary for obtaining a rigorous solution, he used 
a combination of mathematics, intelligent guesswork and inferences from repeated 
samplings which he drew. For example, the steps in his development of the 
^-distribution were as follows: 

(i) The distribution of the mean square, being imknown to British statisti¬ 
cians at that time, had first to be found. Student calculated the first four 
moments by algebraic methods. Noticing that the condition for fitting a Pear¬ 
son Type III curve (2 ^32 — 3 iSi — 6 = 0) held exactly, he inferred correctly that 
«* followed a type III curve. 

(ii) lie was aware that the joint distribution of s* and the sample mean x was 
necessary for a solution of the problem and further that the form of the joint dis¬ 
tribution depended on the correlation between x and Having shown by alge¬ 
bra that is uncorrelated with either £ or x^, he assjumed (without further com¬ 
ment) that the distributions of x and were independent. 

(iii) The final step to the ^-distribution was performed rigorously. 

(iv) The result was then checked against empirical data. He used the heights 
of 3000 criminals, which provided 750 samples of 4. Another set of 750 samples 
was supplied by the lengths of the left middle fingers of the same criminals. 

In 1908 Student also published the distribution of the sample correlation 
coefficient r, in a bivariate normal population in which p is zero. His interest in 
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the problem arose from a meeting of the Royal Statistical Society, in which an 
inquiry was raised about the significance of correlation coefficients derived from 
small numbers of cases. The authorities present, Yule, Hooker and Edgeworth, 
ventured their opinion on a specific instance, but had no general answer. Ac¬ 
cordingly, Student set out to find the general distribution of r, without assuming 
p zero. His mathematics did not take him far; providing the solution only for 
samples of 2, where the frequency is zero except at —1 and •+-!. In order to 
learn what empirical results could teach about the form of the distribution, he 
then drew four sets of 760 samples, the sets having p — 0, n = 4;p = 0, n«!8;p 
= .66, n = 4; and p = .66, n = 8, respectively. 

He considered first the set having p = 0, n = 4. Now, for samples <rf 4, the exact 
distribution, C(1 — reduces to a rectangular distribution. For his 

purpose, it would seem that Student had been imlucky in his choice of n = 4, 
since the rectangular distribution appears to furnish little or no clue as to the 
general form of distribution when n differs from 4. Actually, things could not 
have turned out better. Referring, as in the case of the distribution of s*, to 
Pearson’s curves, he selected a type II on account of the limited range of the dis- 

( j? \0.272 

1 — ) • This sug¬ 

gested to Student that the true form ought to be CCl—r*)®, giving a rectangular 
distribution; the general form was guessed as C(1 — The samples of 8 

were then used merely as corroborative material. One wonders how Student 
would have fared, both in this case and in the case of 5 ^, if the Pearson system of 
curves had not been familiar to him. The method did not of course provide the 
much more complex distribution when p is not zero; however. Student was able 
to write down several important properties of the distribution for the guidance 
of mathematical statisticians who might be interested in the problem. 

By similar methods he attempted (1921) to find the frequency distribution and 
the efficiency of Spearman’s rank correlation coefficient. Much later (1936), he 
remarked in connection with this coefficient that, though it was not fully effi¬ 
cient, ‘‘it was so simple that when playing with other people’s figures, for instance 
on a railway journey, it was the obvious one to use.” 

His paper. The elimination of spurious correlation due to position in time and 
space, though brief and incomplete in its treatment of the problem, established 
him as one of the pioneers in the use of the variate-difference method. His sug¬ 
gestion was to correlate the nth differences of x with those of y for n = 1, 2, • • • 
until a point was reached where the correlation ceased to change. This limiting 
correlation would be free from spurious effects due to trend factors. Such 
methods, he remarks, help to show “whether there really is a close connexion 
between the female cancer death rate and the quantity of imported apples con¬ 
sumed per head.” 

His association with field experiments conducted in Ireland for the purpose of 
developing high yielding varieties of barley suitable for brewing led to an interest 
in the design of experiments, to which he devoted seven papers. His philosophy 
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on this subject, though it appeared to be increasingly out of harmony with the 
philosophy underlying Fisher’s methods, was a natural outgrowth of Student’s 
general attitude towards the applications of statistical techniques. He believed 
that, from general experience of soil fertility patterns plus local knowledge of the 
history and topography of the particular field in question, one could construct a 
systematic design which would be more accurate than the most appropriate 
randomized design. Secondly, although systematic designs rely on nature, as it 
were, to supply the random element necessary for the application of probability 
theory, he believed, again from knowledge of field conditions, that the discrepan¬ 
cies between the mathematical model and the true fertility pattern would not 
seriously vitiate the estimate of error. How far he would have carried the advo¬ 
cacy of systematic arrangements is not clear. Nearly all of his discussions refer 
to arrangements for comparing only two varieties. Moreover, he stressed re¬ 
peatedly the necessity for carrying out such trials over several seasons and at a 
considerable number of places, in order to sample weather and soil variations. 
In such cases the statistical significance of a result at a single place did not interest 
Student greatly. 

A few quotations of his opinions on more detailed questions illustrate his under¬ 
standing of the attitude of the farmer and agronomist and his direct mode of 
expression. Speaking of experimentation with large plots, he says “it has the 
advantage that the farmer, who always has a healthy contempt for gardening, 
may pay some attention to the results.” On adjustments to yield data to allow 
for variations in soil fertility, he writes: “There is a great disadvantage in cor¬ 
recting any figures for position, inasmuch as it savors of cooking, and besides the 
corrected figures do not represent anything real.” Long experience had made 
him more and more convinced that a serial correlation is inevitable in routine 
chemical analyses; accordingly, he advises: “The chemist who wishes to impress 
his clients will therefore arrange to do repetition analyses as nearly as possible at 
the same time, but if he wishes to diminish his real error he will separate them by 
as wide an interval of time as possible,” 

Re-reading of Student’s papers has been a keen pleasure. It is hoped that the 
volume will enjoy a wide circulation amongst statisticians. 


W. G. Cochran 



NEWS and notices 

Readers are invited to submit to the Secretary of the Institute news ifems of interest 

Personal Items 

Dr. P. H. Anderson, formerly of War Production Board in Cleveland, is now 
associated with the Bureau of Foreign and Domestic Commerce in Washington, 

D. a 

Dr, T. A. Bancroft, formerly instructor at Iowa State College, has been pro¬ 
moted to an assistant professorship. 

Jarvis Barnes is now Supervisor of Statistics for the Atlanta Board of Educa¬ 
tion. 

Associate Professor W. D. Baten has been promoted to the rank of Professor 
of Mathematics and Research Professor at the Michigan Agricultural Experi¬ 
ment Station of Michigan State College. 

Isaac L. Battin is now Assistant Professor of Mathematics at Drew Uni¬ 
versity. 

Robert E. Bechhofer has been promoted from Assistant Statistician to Associ¬ 
ate Statistician at Al)erdeen Proving Ground. 

Blair M, Bennett is now Associate Mathematician with the National Bureau 
of Standards at Washington. 

Assistant Professor Thomas A. Bickerstaff on leave from the University of 
Mississippi, is in attendance at the University of Michigan. 

Dr. Ernest E. Blanche, formerly with Curtiss-Wright, is now Principal Statisti¬ 
cian, Chief of Operations with Statistics Division, Foreign Economic Administra¬ 
tion. 

Assistant Professor Frances L. Campbell, on leave from her position as Dean 
of Women at George Pepperdine College, is studying at the University of 
Michigan. 

Professor Haskell B. Curry, on leave from Pennsylvania State College, is now 
Mathematician at the Applied Physics Laboratory, Johns Hopkins Univer¬ 
sity. 

Dr. A. H. Copeland of the University of Michigan has been promoted to a 
professorship. 

Dr. Gerald J. Cox, formerly with the National Research Council, is now 
Research Chemist with Com Products Refining Company, Argo, Illinois. 

Assistant Professor Paul Dressel of Michigan State College has been appointed 
Chairman of the Board of Examiners and Director of Counselling. 

William F. Elkin has accepted a position as Social Science Analyst, Vital 
Statistics Division, Bureau of the Census at Washington. 

Mark W. Eudey, on leave from the University of California, is serving as 
Operations Analyst with the Ninth Tactical Air Command. 
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Associate Professor Herbert P. Evans has been promoted to the rank of Pro¬ 
fessor of Mathematics at the University (rf Wisconsin. 

Raymond H. Fadner, formerly with the OCD, is now Statistician for the 
United Nations Relief and Rehabilitation Administration. 

Dr. Nicholas A. Fattu is now with the College Entrance Examination Board 
at Princeton. 

Evelyn Fix is now Lecturer in Mathematics and Research Assistant at the 
Statistics Laboratory of the University of California. 

Assistant Professor Harold A. Freeman has been promoted to the rank of As¬ 
sociate Professor of Statistics at Massachusetts Institute of Technology. 

Milton Friedman, formerly with the Treasury Department, is now with the 
Columbia University Statistical Research Group. 

E. L. Godfrey is now instructor in Mathematics and Physics at the Northwest 
Missouri State Teachers College at Maryville. 

Harry H. Goode, formerly Statistician with the New York Department of 
Health, is now Research Associate in Mathematics at Tufts College. 

Mrs. Dorothy K. Gottfried, formerly Assistant Statistician with the OPA, 
is now Research Analyst with the United States Army Signal Corps at Newark. 

Trygve Haavelmo, formerly with the Norwegian Shipping and Trade Mission, 
is now Commercial Secretary at the Norwegian Embassy at Washington. 

M. H. Henry, formerly Assistant Statistician with the Michigan Department 
of Social Welfare, is now Cost Analyst for the Fisher Body Corporation at 
Detroit. 

Dr. Robert W, B. Jackson has been promoted to the rank of Assistant Profes¬ 
sor and Assistant Director of the Department of Educational Research at the 
Ontario College of Education. 

Dr. Rachel M. Jenss, formerly Economic Analyst with the Treasury Depart¬ 
ment, is now with the State Department in the capacity of Assistant Chief, Cost 
of Living Division, Foreign Service Administration. 

Dr. Irving Kaplansky is Assistant Research Mathematician at Columbia 
University. 

Leo Katz, formerly Statistician with the Michigan Department of Labor and 
Industry, is now with the Aircraft Development Section of General Motors 
Corporation at Detroit. 

Lila F. Knudsen is now Statistician with the Food and Drug Administration. 

Dr. Richard L. Kozelka has been promoted to the rank of Professor of Eco¬ 
nomics and Statistics and Acting Dean, School of Business Administration at the 
University of Minnesota. 

Dr. George M. Kuznets has been promoted to the rank of Assistant Professor 
of Agricultural Economics at the University of California. 

Max LeLeiko, formerly at the University of North Carolina, is now an in¬ 
structor at Rutgers University. 

Assistant Professor M. S. Macphail of Acadia University has been promoted 
to an associate professorship. 



Dr. Wifiiam G. Madow, formerly with the Bureau of Agriehitural Boouomioct, 
is now Statistieian with the Census Bureau. 

Dr. Benjamin Malzberg is now Director of the Bureau of Statistics, New York 
State Department of Mental Hygiene. *; 

Dr. Henry B. Mann has taken a position as Research Associate at Ohio State 
University. ' 

Dr. John W. Mauchly has been promoted to the rank of Assistant Professor 
of Electrical Engineering at the University of Pennsylvania. 

Dr. R. V. Mises of Harvard University has been elected to membership in the 
American Academy of Arts and Sciences. 

Frederick C. Mosteller is now Research Associate at Princeton Univeisaty. 

Franklin S. Nelson has been promoted from Assistant Statistieian to A^ociate 
' Statistician at Picatiimy Arsenal. 

Dr. Edward B. Olds, formerly Statistician with Curtiss-Wright, is now Eco¬ 
nomic Analyst with the United Nations Relief and Rehabilitation Administra¬ 
tion. 

Assistant Professor Charles K. Payne has been promoted to an associate pro¬ 
fessorship of mathematics at New York University. 

Paul Peach, formerly with United States Rubber Company, is now Quality 
Control Consultant with the War Production Board. 

Associate Professor Phillip J. Rulon has been promoted to the rank of- Pro¬ 
fessor of Education and Acting Dean of the Harvard Graduate School of Educa¬ 
tion. 

Assistant Professor Paul A. Samuelson has been promoted to the rank of 
Associate Professor of Economics at Massachusetts Institute of Technology. 

Miss Marion M. Sandomire, formerly with the War Department, is now 
Statistician with the Navy Department, Bureau of Ships. 

William C. Shelton, formerly with the Department of Commerce, is now 
Senior Mathematical Statistician with the Navy Department. 

Blanche Skalak, formerly connected with Wilson Mechanical Instrument 
Company, is now Statistician with the Census Bureau. 

Associate Professor L. Edwin Smart has been promoted to the rank of Professor 
of Economics at Ohio State University. 

Dr. L. V. Toralballa, who has been a special instructor at the University of 
Michigan during the past year, is now at Princeton University for post-doctoral 
study. 

Afflistant Professor Abraham Wald has been promoted to Associate Professor 
of Statistics at Columbia University. 

John F. Wyckoff has been promoted to Assistant Professor of Mathematics at 
Trinity College, Hartford. 

Ruth Zwerling, formerly with the New York City Employees Retirement 
system, is now Research Assistant with the Milbank Memorial Fund. 
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New Member! 

The following persons have been elected to membership in the Institute; 

} ' ■ 

Bttdne, HtdmaB A. M.A. (State Teachers Coll. Montclair) Asst. Prod. Mgr., Atlas Air¬ 
craft Prod, porp., 405 E. 46 St., New York, N, Y. 47^ JerBey Staten Islandf N, Y, 
Darkow, Marguerite D. Ph.D. (Chicago) Asso. Prof, of Math., Hunter Coll., New York, 
N. Y. le E. 8B St, New York, 88, N. Y, 

STerii Dillon Ph.D. (Iowa) Captain, Ord. Dept., Asst. Army Inspector of Ordnance, St, 
Louis Ord. Plant, 4300 Goodfellow Blvd., St. Louis, 20, Mo. 

Hlntermaler, John C. Grad. (Phil. Textile Inst.) Supervisory Chem. Forstmann Woolen 
Co., 2 Barbour Ave., Passaic, N. J. M Marilyn Place, Clifton, N, /. 

Katz, Amron H. B.A. (Wisconsin) Asso, Physicist, Chief, Shutter Unit, Aerial Photo 
Lab., TSELA-4D, Wri^t Field, Dayton, Ohio, 

Lavln, Mftrvln M. M.S. (Chicago) Stat. Corning Glass Works. 80 E. Second St., Corning, 
New York. 

Levine, Harriet A.B. (Hunter) Asst. Math. Stat., Stat. Res. Group, Columbia Univer¬ 
sity, New York, N. Y. S09 W, 99th St, New York, 86, JV, Y. 

Littauer, Sebastian B. D.Sc. (Mass. Inst. Tech.) Res. Eng., Bendix Radio Div., Bendix 
Aviation Corp., Towson, Md. Box 6797, Towson, 4, Md, 

Pastore, Nicholas M.S. (C. C. N. Y.) Teacher, St. John’s Prep: School, Brooklyn, N. Y. 
8944 Hadcliff Ave., New York, N. Y. 

Rodrigues, Milton da Silva Dr. (Brazil) Prof, of Stat., Univ. of Sao Paulo, Brazil. Inter¬ 
national House, Rm. S8S, 600 Riverside Drive, New York, 87, N. Y. 

Rothbard, Murray Senior (Columbia) 370 Central Park West, New York, N. Y. 

Rubin, Ernest M.A. (Columbia) Res. Analyst, U. S. Dept, of Justice, Immigration & 
Nat. Service, 7104 Emerson Ave., Upper Darby, Pa. 

Sousa, Alvaro Pedro de B.E. (Liverpool) Vice-Governor, Banco de Portugal, Monserrate, 
Rua Infante de Sagres, Estoril, Portugal. 

Wlrick, Grover C., Jr. A.B. (Mich State) Sec. Stat. Res. Dept., Milwaukee Community 
Fund & CJounc. of Soc. Agencies. 3206 W. Wisconsin Ave., ^C-8, Milwaukee, 8, Wis. 

Report on the Third Meeting of the Pittsburgh Chapter of the Institute 

The third meeting of the Pittsburgh Chapter of the Institute of Mathematical 
Statistics was held at Carnegie Union, Carnegie Institute of Technology, on 
Saturday, March 18, 1944. Twenty-five persons attended the meeting, includ¬ 
ing the following four members of the Institute: 

G. G. Eldredge, H. J. Hand, F. G. Norris and E. M. Schrock. 

At the morning session, Mr. E. M. Schrock, Aberdeen Proving Ground, Aber¬ 
deen, Md., presented a paper ‘‘Some Fundamental Aspects of the Shewhart 
Quality Control Chart”. At the afternoon session, Mr. W. H. Thomas, National 
Tube Company Research Laboratory, presented an address “Analysis of Variance 
and Experimental Design in Metallurgical Research”. Mr. F. G. Norris, 
President of the Pittsburgh Chapter, acted as chairman. 

H. J. Hand, 

Secretary-Treasurer of the Pittsburgh Chapter. 

Report on the Fourth Meeting of the Pittsburgh Chapter of the Institute 

The fourth meeting of the Pittsburgh Chapter of the Institute of Mathematical 
Statistics was held at Carnegie Union, Carnegie Institute of Technology on 



Monday ovaning^ Octoba 2,1944. TiMy-fdur persona attended the niyeeltnti' 
including the fdlowihg four members of ^e In^itute: ^ 

J. y. Sturtevant, 3. Mwuele, R, F. Passano, H. J. Hand, 
lieut, J. H. Curtiss, USNR, Bureau of Ships, Washington, D. C., presented,: 
a pap«r on the “Rational Sampling in the Iheparation of Specifications”. 

R. F. Passano, Bethlehem Steel Company, Bethlehem, Pennsylvania, acted as' 
chairman for the meeting. 

H. J. Hajid, 

^scretary^Treasurer of the PitUhurgh CAopter. 



DIRECTORY OP THE INSTITUTE OF MATHEMATICilL STATISTICS* 

(As of November 1, 1944) 

(The jiames of Fellows of the Institute are designated by *) 

Acerboni, Pr<rf. Argentine V. Dr. Ec. (Buenos Aires) Pacultad de C. Economioas, Buenos 
Aires, Argentina. Larroque BSB, Banfield, Argentina, 

Alien, Roy G. D.Sc, (London) Combined Production and Resources Board, Soc. Se¬ 
curity Bldg., Washington, D. C. 1806 Shepherd SL^ N,W,, Washingionf 11, D, C. 

Alt, Pranz L. Ph.D. (Vienna) U. S. Army. $71 Fort Washington Ave,, New York, 8$, 
N, Y. 

Alter, Dinsmore Col., CAC, c/o A.T.S., Fort Mason, San Francisco, Calif. 

Anderson, Paul H. Ph.D. (Illinois) Stat. Analyst, Distribution Mgt. Unit, Bur. of 
Foreign & Domestic Com., Washington, 25, D. C. 

Anderson, Richard L. Ph.D. (Iowa State Coll.) Res. Math., Princeton Univ., Fine Hall, 
Princeton, N. J. 

Anderson, Theodore W*, Jr. M.A. (Princeton) Res. Math., Princeton Univ., Fine Hall, 
Princeton, N. J. 

Andrews, T. Gaylord. Ph.D. (Nebraska) Instr. Barnard College, Columbia Univ., New 
York, N. Y. 

Angell, Dorothy T. Stat. Analyst, Bell Tel. Labs., Murray Hill, N. J. 

Arias, B., Jorge C.E. (Guatemala) 3 Avenida Sur 65, Guatemala City, Guatemala, 
Central America. 

Arnold, Asso. Prof. Herbert E. Ph.D. (Yale) Wesleyan Univ., Middletown, Conn. 

Arnold, Kenneth J. Ph.D. (Mass. Inst. Tech.) Sen. Math. Stat., Stat. Res. Group, 
Columbia Univ., 401 W. 118th St., New York, 27, N. Y. 

Aroian, Leo A. Ph.D. (Michigan) Instr. Hunter Coll., New York, N. Y. $4^^ Wadsworth 
Ave,, New York, 88, N, Y, 

Arrow, Kenneth J. M.A. (Columbia) Ist Lieut., A. C. Weather Div., Ac/As, O.C. & R., 

, Hdqtrs. AAF, Pentagon Bldg., Washington, D. C. 141 O Nicholson St., N,W,, Wash¬ 
ington, 11, D.C, 

Bachelor, Robert W. M.B.A. (Washington) American Bankers Assn., 22 East 40th St., 
New York, 16, N. Y. 

Bacon, Asso. Prof. Harold M. Ph.D. (Stanford) Stanford Univ., Stanford, Calif. Box 
1114- 

Bailey, Arthur L. B.S. (Michigan) Stat., American Mutual Alliance, 60 E. 42nd St., 
New York, N. Y. P, 0, Box 278, Ramsey, N, J. 

*Baker, Asst. Prof. George A. Ph.D. (Illinois) Asst. Prof, of Math, and Asst. Stat. in 
the Exp. Sta., College of Agriculture, Univ. of California, Davis, Calif. 

Baldwin, Woodson W. S.B. (Mass. Inst. Tech.) Capt., Ord. Dept., U. S. A., Office of 
Field Dir. of Ammunition Plants, 3629 Lindell Blvd., St. Louis, 8, Mo. 8746 Lindell 
Blvd, 

Bancroft, Asst. Prof. Theodore A. Ph.D. (Iowa State ColL) Iowa State Coll., Math. 
Dept., Ames, Iowa. 

Barnes, Jarvis M.A. (George Peabody Coll, for Teachers) Atlanta Board of Educ. 14th 
Floor, City Hall, Atlanta, Ga. 

* Members were asked to supply fresh information for this Directory, Records may be 

inexact or incomplete because of failure of some members to comply with this request. 

Changes in addresses, or errors in names, titles or addresses, should be reported to the ^cre- 

tary. 
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Bmefti Jolm t* Ph,D. (Princeton) Tech, Staff, Bell Tel, l^be,, ^ St,, New Yot4, 
N. y. WE, River Rd,, Rvmean, N,'J. 

Barr, Prof. ArvU S. Ph.B. (Wisconsin) Univ. of Wisconsin, Madison, Wis# 
Barnd-Sotito, Prof. Jos^ Sc.D. (Buenos Aires) Univ. of Buenos Aires, Buenos Aires, 
Ai^ntina. Cordoba IJ^B, 

*Bartky, Asso. Pean Walter Ph,D. (Chicago) Univ, of Chicago, Chicago, BL 
Bassford, Horace R, B.A. (Trinity Coll,) Vice Pres, and Actuary, Metropolitan life 
Ins. CJo., 1 Madison Ave,, New York, 10, N, Y. 

^Baten, Pr<^. William D. Ph.D. (Michigan) Prof, of Math. Mich. State Coll, and Res. 
Prof. Mich. Agri. Exp. Sta., Mich. State Coll., E. Lansing, Mich. 4 tl Marekall Si,, 
E, LaneinQf Mich, 

Bates, Prof. O. Kenneth Sc.D. fMass. Inst. Tech.) Prof, of Math, and Head of Dept., 
The St. Lawrence Univ., Canton, N. Y. 

Battin, Asst. Prof, Isaac L. A.M. (Swarthmore) Drew Univ., Madison, N. J. 14 Glen» 
wild Rd,f MadisoUf N,J, 

Beal), Geoffrey Ph.D. (London) Asst. Entomologist, pominion Entomological Lab., 
Chatham, Ont., Can. 

Bechhofer, Robert £. A.B. (Columbia) Asso. Stat. (Ordnance) Ordnance Res. A 
Development Center, Arms & Ammunition Div., Aberdeen Proving Gd., Md. Box 
21 y Aberdeeriy Md, 

Becker, Harold W. Elec. Inst., Mare Is. Training School, Bldg. 146, Mare Island, Calif. 
1426 Amadory Vallejo, Calif, 

Beebe, Lieut. Gilbert W. Ph.D. TColumbia) Sn. C., A. U. S. Control Div., Office of the 
Surgeon General, 1818 H St., N.W. Washington, D. C. 

Been, Richard O. M.A. (George Washington) Sr. Agric. Econ., U. S. Bur. of Agric. 

Economics, 3433 South Bldg., Washington, D. C. 

Bellison, Harold R. S. B., S. M. (Mass, Inst. Tech.) Industrial Eng., War Dept., Ord. 

Dept., Pentagon Bldg., Arlington, Va. S416 B St,, S,E,, Washington, 19, D, C, 

Belz, Asso. Prof. Maurice H. M.A. (Melbourne) Univ, of Melbourne, Carlton, N. 3, 
Victoria, Australia, 

Bennett, Prof. Albert A. Ph.D. (Princeton) Brown Univ., Providence, R. I.; Maj., 
Ord. Dept., Ballistics Res. Lab., Aberdeen Proving Gd., Md. 

Bennett, Blair, M. M.A. (Columbia) Asso. Math. Nat. Bur. of Standards, Washington, 
D. C. 1410 M St,, N,W.y Washington, D, C, 

Bennett, Carl A. A.M, (Michigan) ^ Stat. General Del, Oak Ridge, Tenn, 

Berger, Richard M.A. (Columbia) Ensign, U. S. Navy, Rugby Rd,, Rockville Centre, 
N. Y, 

Berkson, Joseph M.D., D.Sc. (Johns Hopkins) Col. M. C., U. S. Army, Army Air Forces, 
Washington, D. C. 

Bickerstaff, Asst. Prof. Thomaa A. M.A. (Mississippi) 709 Haven St,, Ann Arbor, Mich, 
Bimbaum, Asst. Prof. Z. William Ph.D. (Lwow) Univ, of Washington, Seattle, Wash. 
Blackadar, Walter. L. B.A. (McMaster) Asso. Actuary, Equitable Life Assurance 
Society of the U. S. S93 7th Ave,, New York, 1, N,Y, 

Blackburn, Asso. Prof. Raymond F. Ph.D. (Pittsburgh) Head, Dept, of Stat., Univ, 
of Pittsburgh, Pittsburgh, 13, Pa. 

Blake, Archie Ph.D. (Chicago) Eng. Natl. Inventors Council, Washington, D. C. 2812 
Cathedral Ave,, N,W,, Washington, D, C. 

Blanche, Ernest E. Ph.D. (Illinois) Prin. Stat., Chief of Operations, .Foreign Econ. 
Admin., Stat. Div., 515-22nd St., N. W., Washington, D. C. $228 Martha Custie 
Drive, Parkfairfax, Alexandria, Va, 

*Bll8S, Cheater I. Ph.D. ((Ik)lumbia) Biometrioian, Conn. Agri. Exp. Sta.; Lecturer in 
Biometry (Asso. Prof.), Yale Univ,, New Haven, Conn. Box 1106, 

Bloom, Rose B.A. (Hunter Coll.) Private, 1263 SCSU, Mason GH, Brentwood, Long 
Island, N. Y. 
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Bonii, Attstiii J. E.8. (C. C. N. Y.) Major, G-I War Dept* Gen. Staff, W$aliington, D« C« 
BSOO Que St., N.W., Washington, D. C. 

Bonnar, Robert U. M.S. (Washington) Asso. Chemist, Bur. of Ships, Navy Dept., Wash* 
ington, D. C. 4.14 Whitestone Rd,, Silver Spring, Md. 

Booaer, Mary £• A.M. (Chicago) Stat. Res., Virginia State Planning Bd«, 301 Finance 
Bldg., Richmond, 19, Va. 

BcMlchan, I%til Ph.D. (Vienna) Econ. Inst., 500 Fifth Ave., N. Y. 104 W. 40th St., 
New York, 18, N. Y. 

Bower, Oliver K. Ph.D. (Illinois) Associate, Univ. of Ill., Urbana, Ill. 60d W, John, 
Champaign, JIL 

Bowker, Albert H. S. B. (Mass. Inst. Tech.) Asst. Dir. Stat. Res. Group, Div. of War. 
Res., (Columbia Univ., New York 27, N. Y. 

Brady, Dorothy S. Ph.D. (California) Home Ec, Specialist, Bur. of Home Econ., Wash¬ 
ington, D. C. 44^4 First Place, N.E,, Washington, D. C. 

Brandt, Alva £. Ph.D. (Iowa State Coll.) Tech. Observer, USAAF. Box 89, Route S, 
Vienna, Va. 

Breorty, Charles R. B.S. (California) Major, Signal 0>rps, Signal Corps Inspection 
Agency, 19 West 4th St., Dayton, 2,0. 

Breden, Robert £• B.S. (Kansas State) Personnel Tech., Tech. Section, War Dept., 270 
Madison Ave., New York, N, Y. 9204 77th St., Jackson Heights, L. N. Y. 

Brldger, Clyde A. M.S. (Oregon State Coll.) Instr., Univ. of Utah, Salt Lake City, 1, 
Utah. SS2 Douglas St,, Salt Lake City, 2, Utah. 

Brlzey, Asso. Prof. John C. Ph.D. (Chicago) Univ. of Oklahoma, Norman, Okla. 927 
S. Pickard St., Norman, Okla. 

Bronfenbrenner, Martin Ph.D. (Chicago) Lieut. (J.G.) U. S. Navy Reserve, Office of 
CINCPAC, c/o Postmaster, San Francisco, Cal. 728 N. First Ave., Tucson, Ariz. 

Brookner, Ralph J. Ph.D. (Columbia) Lieut. USNR, Midshipmen’s School, 265 River¬ 
side Drive, New York, 25, N. Y. 

Brooks, Alvin G. B.A. (Ripon Coll.) Chief of Inspection Tasks Sec., Western Elec. Co., 
Hawthorne Sta., Chicago, Ill. 4dS8 Lawn Ave., Western Springs, III. 

Brown, Asst. Prof. Arthur B. Ph.D. (Harvard) Queens Coll., Flushing, N. Y. 

Brown, Arthur W. A.B, (Princeton) Res. Asso., Columbia Univ., Div. of War Res., New 
York, N. Y. Columbia Res, Croup M, Room 4^11, COMINCH, Navy Dept,, Washing¬ 
ton, D. C. 

Brown, George W. Ph.D. (Princeton) Res. Eng., RCA Labs., Princeton, N. J, 

Brown, Richard H. A.B. (Columbia) Asso. Math., Navy Dept., Bur. Ord., Washington, 
D. C. Rm 816-1 84I6 S8th St., Washington, 16, D. C. 

Brumbaugh, Prof. Martin A. Ph.D. (Pennsylvania) Univ. of Buffalo, Crosby Hall, 
Buffalo, 14, N. Y. 

Broyere, Martha, (Mrs. Paul T.) M.D. (Chicago) Stat. U. S. Public Health Service, 
Bldg. T6, Bethesda, Md. R. F, D. Rt. if, 1, Gaithersburg, Md. 

Bnsyere, Paul T. M.P.H. (Yale) Stat. U. S. Public Health ^rvice. Bldg. T6, Bethesda, 
Md. R. F. D. Rt. ifl, Gaithersburg, Md. 

Bryan, Joseph Ed.M. (Harvard) Mass. Inst, of Tech., Cambridge, Mass. Apt. 508,1010 
26th St., N.W., Washington, D. C. 

Budne, Thomas A. M.A. (N. J. State Teachers Coll.) Asst. Production Mgr., Atlas Air¬ 
craft Products Corp., 405 E. 42nd St., New York, N. Y. 

Burgess, Robert W. Ph.D. (Cornell) Chief Economist, Western Electric Co., 195 Broad¬ 
way, New York, 7, N. Y. 

Burk, Marjorie Asso., Stat., Meteorology, Weather Div., AAF, Washington, D. C. 

Buros, Oscar K. M.A. (Ck>lumbia) Major, Sig. Corps, Chief, Standards Sec., School 
Div., Army Service Forces, Washington, D. C. 801S. Courthouse Rd., Arlington, Va, 

Burr, Asst. Prof. Irving W. Ph.D, (Michigan) Purdue Univ., W. Lafayette, Ind. 
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]3iitli«yi Alto. Prof. J« iptobaH; Ph.D. (Miol^gaii) Hunter Coll. 695 Fork'Ave.^ New 
York,21,N.Y. 

*Caiup, Prof. Burton H. Ph.D. (Yale) Wesleyan Univ., Middletown^ Ck>m, HO Mi. 
Vernon SU 

Campbell, Frances L. M.A. (California) Asst. Prof, of Math. A Dean of Women, Oeoi^e 
Pepperdine CoU., Los Angeles, Calif. Siockwell Hull, 1^1049^ Ann Arbor, Mich. 

Campbell, George C. M.S. (Iowa) Major, U. S. Army, Pine Bluff Arsenal, Arkansas. 

Cap6, Bcjmardo G. Ph.D. (Cornell) Biometrician, Agri. Exp. Sta., Rio Piedras, Piferto 
Rico. $9i Rosario Si., Santurce, San Juan, P. R. 

Carlson, John L. M.A. (Stanford) Lt. Comdr., USNR, VRIO-Fleet P. O., San Fran* 
cisco, Calif. 

Carlton, A. George B.A. (Gustavus Adolphus) Stat., Applied Physics Lab., Johns 
kins XJniv., Silver Spring, Md. 409 S4th St., N.E,, Washington, D. C, 

Carter, Gerald C. Ph.D. (Purdue) Head Instr., Naval Training ^hool, Purdue ITniv., 
Lafayette, Ind. 6 S0 Garfield, West Lafayette, Ind. 

•Carver, Prof. H. C. Ph.D. (Michigan) on leave with U. S. Army Air Forces, Univ. of 
Michigan, Ann Arbor, Mich. 

Casanova, Teobaldo Ph.D. (New York) Res. Stat., Univ. of Puerto Rico, Rio Piedras, 

P. R. 

Cederberg, Prof. William £. Ph.D. (Wisconsin) Augustana Coll., Rock Island, 111. 
£649 ££i Ave. 

Chances, Ralph B.B.S. (C.C.N.Y.) Stat., Industrial Surveys Co., 347 Madison Ave., 
New York, N. Y. 

Chang, Calvin C. M.A. (Michigan) Stat., Seattle-Tacoma Shipbuilding 0>rp., Tacoma, 
Wash. 10S16 E. Marginal Way, Seattle, Wash. 

Chapman, Roy A. B.S. (Minnesota) Silviculturist, Southern Forest Exp. Sta., New 
Orleans, La. Hitchiti Experimental Forest, Round Oak, Georgia. 

Chen, Way Ming B.S. (Nat. S.W, Associated Univ., China) Dept, of Math., Univ. of 
Calif., Berkeley, 4, Calif. 

Churchman, C. West Ph.D. (Pennsylvania) Math. Frankford Arsenal, Ordnance Lab., 
Philadelphia, 37, Pa. 

Clark, Asso. Prof. A, G. A.M. (Colorado) Colorado State Coll, of A. and M., Fort Collins, 
Colo. 

Clarkson, Asso. Prof. John M. Ph.D. (Cornell) North Carolina State Coll,, Raleigh, 
N. C. 

Clifford, Asst. Prof. Paul C. M.A. (Columbia) State Teachers Coll., Montclair, N. J.; 
Stat. (k)nsultant, Wright Aeronautical Corp., Paterson, N. J. 641 Upper Mountain 
Ave., Upper Montclair, N. J. 

Clinedinst, William O. B.S. (Carnegie Inst. Tech.) Eng., National Tube Co., Frick 
Bldg., Pittsburgh, Pa. 

Cloudman, Charles G. M.Sc. (Rhode Island State Coll.) Valuation Eng., Ebasco Serv¬ 
ices, Inc., 2 Rector St., New York, N. Y. £6 Clark St., Brooklyn, N. Y. 

Cobb, William J. Stat. Census Bureau, 4036 8th St., N.E., Washington, D. C. 

•Cochran, Prof. William G. M.A. (Cambridge) Iowa State CJoll., Ames, Iowa. 

Cody, Donald D. A.B. (Harvard) Res. Math., Indianapolis Naval Ord. Plant, Indianapo¬ 
lis, Ind. 

Coggins, Paul P. A.M. (Harvard) Personnel Methods Asst., American Tel. <k Tel. Co,, 
195 Broadway, New York, N. Y. 

Cohen, Alonzo C., Jr. Ph.D. (Michigan) Major, Maint. Div. A.S.F., Washington, 25, 
D, C. 

Cohen, Jozef B. B.S. (Chicago) Sage Fellow in Psychology, Cornell Univ., Ithaca, N, Y. 

Cohen, Melvin S. B.A. (Brooklyn) U. S. Army. 1£16 Avenue N, Apt, 6D, Brooklyn, 
N. Y. 
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Coluiy lBad(»*e A. M.A. (Montclair State Teachers Coll.) 8t&t, 10 Ann 8 t, Dover, N. J. 

Coleman, Bdward P. M.S. (Iowa) Major, CAC, U. S. Military Academy, West Point, 
N. Y. 

Cope, Asso. Prof. T. Freeman Ph.D. (Chicago) Queens Coll., Flushing, N. Y. 

^Copeland, Prof. Arthur H. Ph.D. (Harvard) Univ. of Michigan, Ann Arbor, Mich. 
616 Oswego St, 

Cotterman, Charles W. Ph.D. (Ohio State) Res. Asso., Lab. of Vertebrate Biology, Univ. 
of Michigan. On leave for army serv. 6S7 Hawthorne Rd.^ Winston-Salem^ N, C, 

Court, Louis M. M.A. (Cornell) 1620 Crotona Park E., New York, 60, N. Y. 

Cowan, Donald R. G. Ph.D. (Minnesota) Consulting Economist, 2920 Broxton Rd., 
Shaker Heights^ Cleveland, 20, Ohio. 

Cox, Gerald J. Ph.D. (Illinois) Res. Chem., Corn Prod. Refining Co,, Argo, Ill. 200 
South 7th Ave,, La Orange, III. 

*Cox, Prof. Gertrude M. M.S. (Iowa State Coll.) Head, Dept, of Exp. Stat., North 
Carolina State Coll., Raleigh, N. C. 

*Craig, Allen T. Ph.D. (Iowa) Asso. Prof., Univ. of Iowa, Iowa City, la. Postgraduate 
School, U, S. Naval Academy, Annapolis, Md. 

♦Craig, Prof. Cecil C. Ph.D. (Michigan) Univ. of Michigan, Ann Arbor, Mich. 6020 
Angell Hall. 

♦Crathome, Prof. A. R. Ph.D. (Gottingen) Univ. of Illinois, Urbana, Ill. 

Crawford, Mrs. Elizabeth S. B.A. (Mundelein Coll.) Asso. Labor Market Analyst, War 
Manpower Commission. 965 Lincoln St., Denver, 6, Colo. 

Crawford, James R. Dept. Mgr., Master Scheduling, Lockheed Aircraft Corp., Burbank, 
Calif, 11626 Kiitridge St., N. Hollywood, Calif. 

Crosby, Edwin L. Dr. P. H. (Johns Hopkins) Asst. Dir., Johns Hopkins Hosp., Asso. in 
Biostatistics and Preventive Medicine, Johns Hopkins Univ., Baltimore, 5, Md. 

Crump, S. Lee B.S. (CJorncll) Res. Asso., Iowa State C>)11., Stat. Lab., Ames, Iowa. 

Cudmore, Sedley A, M.A. (Oxford) Dominion Stat., Dominion Bur. of Stat., Ottawa, 
Canada. 

Cureton, Edward E. Ph.D. (Columbia) Chief, Civilian Personnel Res., The Adj. Gen. 
Office, War Dept., 270 Madison Ave., New York, 16, N. Y. 162 Larchmont Ave., 
Larchmont, N. Y. 

Curry, Prof. Haskell B. Ph.D. (Gottingen) Pennsylvania State ColL, State College, Pa.; 
Math. Applied Physics Lab., Johns Hopkins Univ., 8621 Georgia Ave., Silver Spring, 
Md. 

♦Curtiss, John H. Ph.D. (Harvard) Lieut. USNR, Code 333, Bur. of Ships, Navy Dept., 
Washington, D. C. 

♦Daly, Joseph F. Ph.D. (Princeton) Lieut. USNR, Academic Aide to the Conunanding 
Officer, USNR Midshipmen’s School, New York, 27, N. Y. 

Dantzig, George B. M.A. (Michigan) Chief, Combat Analysis Branch, Stat. ([Control 
Div., Hdq. AAF, Pentagon Bldg., Washington, D. C. 6241 21st North, Arlington, Va. 

Darkow, Marguerite D. Ph.D. (Chicago) Asso. Prof. Math. Himter Coll. 16 E. 82nd 
St., New York, 28, N. Y. 

Day, Besse B, M.A. (Michigan) Stat., Applied Physics I^b., Johns Hopkins Univ,, 
Silver Spring, Md. 6166 Dumbarton Ave., Washington, D. C. 

De Castro, Prof. Lauro S, V. C. E. (Escola Nacional de Enginharia) Catholic Univ., Rio 
de Janeiro, Brazil. 62 Rua David Campista. 

Deemer, Walter L., Jr. Ed.D. (Harvard) Major, Hdq. Air Forces Training Command, 
Fort Worth, Texas. 

De Lury, Asst. Prof. Daniel B. Ph.D. (Toronto) Univ. of Toronto, Toronto, 5, Canada. 

♦Deming, W. Edwards Ph.D. (Yale) Chief Math,, Bur. of the Budget, Washington, 25, 
D. C. 



DetiMii, Aitt. Prof. Pmd M« D,8e, (Johns Eof^dns) Vs^derbilt Uniir, Sobohli 

Nashville, 4, Tenn, 

Dietxold, Robert L. S.B. (Mass. Inst, Tech.) Member Tech. Staff, Bell Tel. Labs., 468 
West St., New York, N.Y. 

*Dieolefait, Prof, Carlos B. Dir., Institute Estadistioa, Univ. Nacional del titoral, Pueyr- 
ledon 1235, Hosario, Argentina. 

Di Salvatore, Philip M.A. (Princeton) U. S. Army. 1 B4 Indian Run Parkvoay N., 
Union, N. J. 

Dlvatia, M. V, Dept, of Industries A Civil Supplies, New Delhi, India. 

Dlx, Margaret J. M.A. (Rice Inst.) Secretary, Univ. of California, Berkeley, 4, (2alif. 

Stat, Lab,, Univ, of Cal,, Berkeley, 4, Calif, ' . ^ 

Dixon, Asst. Prof. Wilfrid J. Ph.D. (Princeton) Res. Math., Princeton Univ., Fine Hall, 
Princeton, N. J. On leave from Univ. of Okla. 

'^Dodge, Harold F. A.M. (Columbia) Quality Results Eng., Bell Tel. Labs., Inc., 468 
West St., New York, N. Y. 

'"Doob, Asso. Prof. Joseph L. Ph.D. (Harvard) Univ. of Illinois, Urbana, Ill. Oxford 
Rd,, Glen Echo, Md, 

Dorfman, Robert M.A. (Columbia) Operations Analyist, U. S. War Dept., 3rd Opera¬ 
tions, Analysis Sect., Far Eastern Air Force, APO 926, c/o Postmaster, San Francisco, 
Calif. 

Dom, Harold F* Ph.D. (Wisconsin) Major, War Dept., Washington, D. C. IS Burning 
Tree Court, Betheada H, Md. 

Dorweller, Paul B.S. (Iowa) Actuary, Aetna Casualty A Surety Cp., Hartford, Conn. 
Dresch, Francis W. Ph.D. (California) Lt. Com. USNR, Naval Proving Grounds, 
Dahlgren, Va. 

Dressel, Asst. Prof. Paul L. Ph.D. (Michigan) Director of Counseling, Chairman, Board 
of Examiners, Mich. State Coll., East Lansing, Mich. 

Dunlap, Jack W. Ph.D. (Columbia) Lt. Com. U. S. Navy. University of Rochester, 
Rochester, N. Y. 

Durand, David Ph.D. (Columbia) Lieut. (S.G.) USNR, U. S. Navy. 

Dutka, Jacques Ph.D. (Columbia) Asst. Res. Math., Columbia Univ., Div. of War 
Res, New York, N. Y. 740 Gerard Ave., Bronx, N, Y, 

Duvall, George E. Asst. Physicist, Univ. of California, Div. of War Res., USN Radio A 
Sound Lab., San Diego, 62, Calif. 

*Dwyer, Asso. Prof. Paul S. Ph.D. (Michigan) Univ. of Michigan, Ann Arbor, Mich. 
2509 James Si. 

Edgett, Asst. Prof. G. L. Ph.D. (Illinois) Queen's Univ., Kingston, Ont., Canada. 
Edwards, George D. A.B. (Harvard) Dir. Quality Assurance, Bell Tel. Labs., Inc., 463 
West St., New York, 14, N. Y. 

""Eisenhart, Churchill Ph.D, (London) Univ. of Wisconsin, Madison, Wis.; Stat. Res. 

Group, Div. of War Res., (k)lumbia Univ., New York, 27, N. Y. 

Elconin, Victor M.S. (Cal. Inst. Tech.) Asso. Physicist, Calif, Inst, Tech.; Tech. Dir. 
Airtronics Mfg. Co.; Tech. Dir. Electronic Industries Tech. Inst., Los Angeles, Calif. 
740 Cordova Ave., Glendale, 6, Calif. 

Eldredge, George G. Ph.D. (Minnesota) Res. Chem. Eng., Aluminum Co. of America, 
Aluminum Res. Labs., New Kensington, Pa. 

Elkin, William F. M.S.P.H. (Michigan) Social Science Analyst, Vital Stat. Div., U. S. 

Bur. of the Census, Washington, 26, D. C. $11 S. Juniper St., Philadelphia 7, Pa, 
Elkins, Thomas A. A.M. (Princeton) Geophysicist, Gulf Res. A Dev. Co., P. O. Drawer 
2638, Pittsburgh, Pa. $255 Parkview Ave., Pittsburgh, IS, Pa, 

Bills, Wade Ph.D. (Michigan) Spec. Instr. in Math., Univ, of Michigan. 921 Woodlawn, 
Ann Arbor, Mich. 
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Slttoiii Imum 8* A.B* (Comell) Asst. ActuaiTi Travelers Ins. Go.| Hartford^ 

Sltlnft John P. M.S. (Mass. Inst. Tech.) Dir. of Res., Kendall Mills, Paw Creek, N. C. 

BlToback, Mary L« M.A. (Minnesota) Instr. Michigan State Coll., East Lansti^, Midh. 

Bodey, Hark A.B. (California) Lecturer in Math and Res. Asst, in Stat. Xab,, XJniv. of 
Galifornia, Berl^ley, Calif. On leave from Univ. N-008031, H. Q. 9th Tact. Air 
Comd., O.R.S., A.P.O. 595, New York, N. Y. 

Bvansy Prof. Herbert P. Ph.D. (Wisconsin) North Hall, Univ. of Wisconsin, Madison, 
6, Wise. 

Bvans, Wilmoth D. B.S. (Clarkson Coll. Tech.) Chief, Productivity Sc Technological 
Dev. Div., U. S. Bur. of Labor Stat., Washington, 25, D. C. 

Evers, Dillon Ph.D. (Iowa) Capt. Ord. Dept., St. Louis Ordnance Plant, 43(X) Good- 
fellow Blvd., St. Louis, 20, Mo. 

Fadner, Raymond H. B.A. (Minnesota) Stat., United Nations Relief & Rehab. Admin., 
Dupont Circle Bldg., Washington, D. C. 17S6 Oee SL, N,W, 

Fattu, Nicholas A. Ph.D. (Minnesota) College Entrance Exam. Board, Princeton, N. J. 
B04 W, Cedar Chippewa Falls, Wis, 

Feldman, Hyman M. Ph.D. (Washington) Beaumont High School, St. Louis, Mo. 

Fellppe, Jos5 G. Pres. Nat. (Densus Commission. Trav, Sta Terezinha, 55, Rio de Janeiro, 
Brazil. 

*Feller, Asso. Prof. Will Ph.D. (Gottingen) Brown Univ., Providence, R. I. 

Ferrell, Enoch B. M.A. (Oklahoma) Member Tech. Staff, Bell Tel. Labs., 463 West St., 
New York, N. Y. 75 Fuller Ave., Chatham, N. J. 

Ferris, Charles D. A.B. (Princeton) Sgt., Surveillance Branch, Ballistic Res. Lab., 
Aberdeen Prov. Grounds, Md. 

Fertlg, Prof. John W. Ph.D. (Minnesota) Biostatistics, DeLamar Inst, of Pub. Health, 
(i)lumbia Univ., 600 W. 168th St., New York, 32, N. Y. 

Fetters, Karl L. D.Sc. (Mass. Inst. Tech.) Asst, to Vice-Pres., Youngstown Sheet & 
Tube Co., Stambaugh Bldg., Youngstown, Ohio. 

Field, Robert W. Ph.D. (Illinois) Asso. Prof, of Industrial Eng., Purdue Univ., La¬ 
fayette, Ind. 

File, Quentin W. Ph.D. (Purdue) Labor Utilization Analyst, Wright Aeronautical Corp., 
Cincinnati, Ohio. 55 Gahl Terrace, Reading, 15, Ohio. 

Fischer, Asst. Prof. Carl H. Ph.D. (Iowa) Univ. of Michigan, 3016 Angell Hall, Ann 
Arbor, Mich. 

*Flsher, Prof. Irving Ph.D. (Yale) Prof. Emeritus of Economics, Yale Univ., Now 
Haven, 8, Conn. 

Fix, Bvel3m M.A. (Minnesota) Lecturer in Math. & Res. Asst., Stat. Lab., Univ. of 
California, Berkeley, Calif. 

Flaherty, Asst. Prof. William C. A.B. (Georgetown) Georgetown Univ., Washington, 
D. C. 

Flood, Merrill M. Ph.D. (Princeton) Res, Asso. in Math., Princeton Univ., Princeton, 
N. J. 20 Nassau St. 

Foster, Prof. Ronald M. S.B. (Harvard) Head of Math. Dept., Polytechnic Inst, of 
Brooklyn, Brooklyn, N. Y, 122 E. Dudley Ave., Westfield, N. J. 

Fox, Asso. Prof. Phillip G. A,M. (Wisconsin) 403 Sterling Hall,.Univ. of Wisconsin, 
Madison, Wis. 

Frankel, Lester R. M.A. (Columbia) 1538 E. 27th St., Brooklyn 29, N. Y. 

Fraser, Andrew Jr. M.A. (George Washington) Prin. Admin. Officer, Control Div., 
Office of the Chief, Signal Office, War Dept., Washington, D. C. 6441 Wiscasset Road, 
Glen Echo Heights, Md., Washington, 16, D. C, 

Freeman, Albert M. Dir. Math. Lab., Boston Fiduciary & Research Associates. 8 Park- 
side Rd., Providence 6, R. I. 



Aaio« Hftr<^d A. S3. (Mm. Inst. Teih.) K£utt. Inst, df ITstoH., 
bridge^ Mass. 

Frewad^ JbhnE. M.A. (U.C.L.A.) G»d, Siudeat, 3Box 4221,1/w Angsles/H 

^Friadjnan* MUton M.A. (Chicago) Stat. Hes. Group, Div. of War Res., Coltanbia tJniv,, 
401W. 118 St., New York, 27, N. Y. 467 Central Park Weet, New York, U, N. Y. 

♦Fry, Thornton C. Ph.D. (Wisconsin) Bell Tel. Labs. Inc., 468 West St., New York, 14, 
N.Y. 

Fryer, Asso. Prof. HoUy C. Ph.D. (Iowa State Coll.) Kansas State Coll., Manhattan 
Kan. On leave—Res. with Applied Math Panel, 401W. 118 St., New York^ 27, N. Y. 

Gage, Robert P. M.S. (Iowa State Coll.) Asso. Div. of Biometry A Medical Stat., Mayo 
Clinic, Rochester, Minn. 

Gause, G. Rupert B.S. (Citadel) Sr* Ord. Eng., Office of Chief of Ord., Washington, D. 
C. 1604 Peach SL, Alexandria, Va. 

*Geirlnger, Prof. Hilda P. Ph.D. (Vienna) Head, Math. Dept., Wheaton Coll., Norton, 
Mass. 

Gerlough, Daniel L. B.S. (Calif. Inst, of Tech.) Quality Control Eng., Plomb Tool Co.* 
Box 3519 Terminal Annex, Los Angeles, 54, Calif. 

Germond, Hallett H. Ph.D. (Wisconsin) Asso. Prof, of Math., Univ. of Florida, Gaines* 
ville, Fla. (On leave) 

Ghonnley, Glen £. Stat. Analyst, Lockheed Aircraft Corp., Burbank* Calif. 

Gibson, Robert W. Ph.D. (Illinois) WaKeeney, Kansas. 

Gifford, Kenneth R. Student, Mass. Inst, of Tech., Cambridge, Mass. 97 Bay State Rood, 

Gill, John P. M.A, (Alabama) Stat., Dept, of Res. A Stat,, Fed. Reserve Bank, Dallas, 
Texas. 

Gintzler, Leone Belle M.A. (California) Standard Oil Co. of Calif., 225 Bush St., San 
Francisco, Calif. 2S65 Slat Ave., San Francisco, 16, Calif, 

^Girshlck, Meyer A, M.A. ((k>Iumbia) Div. of Stat. A Hist. Res., Bur. of Agric. Eco¬ 
nomics, Washingt^, D. C. 

Godfrey, Edwin L. A.M. (Indiana) Instr. Math. A Physics, The Northwest Missoiiri 
State Teachers Coll., Maryville, Mo. 

Goldberg, Henry M.A. (dblumbia) Asst. Math. Stat., SRG, Columbia Univ., 401 W. 
118th St., New York, 27, N. Y. 

Goode, Harry H. B.S. (New York Univ.) Res. Asso. in Math., Tufts Coll., Medford, 55, 
Mass. 3251 Cambridge Ave,, New York, 63, N, Y, 

Gordon, Chester H. B.S. (Bethany) Res. Asst., Mass. Inst, of Tech., Cambridge, Mass. 

Gordon, Donald A. A.M. (Columbia) Asst., Columbia Univ., Morningside Heights, New 
York, N. Y. Psychology Dept,, c/o Prof, C, J, Warden, 

Gordon, Robert D. A.M. (Stanford) Div. of War Res., Columbia Univ., New York, N. Y. 

Gottfried, Bert A. M.A. (Columbia) U. S. Army, OQM, Hq., UK. Base, APO lf^871, c/o 
Postmaster, New York, N. Y. 

Gottfried, Mrs. Dorothy K. A.B. (Hunter) Res. Analyst, U. S. Army, Signal Corps, 
Newark, N, J. 6035 Boulevard E,, West New York, N, J, 

Grant, Eugene L. A.M. (Columbia) Prof, of Economics of Engineering, Stanford Univ., 
Dept, of Civil Eng., Stanford Univ., Calif. 

Graves, Asst. Prof. Clyde H. Ph.D. (Chic^) Pennsylvania State Coll., State College, 
Pa. 700 N, Wayne St,, Arlington, Va, 

Greenleaf, Prof. Herrick E. H. Ph.D. (Indiana) DePauw Univ., Greencastle, Ind. 
1024 College Ave,, Greencastle, Ind. 

Greenwood, Jose^ A. Ph.D. (Missouri) Lieut. U.S.N.R., U. S. Navy, Armament Test, 
N.A.S. Patuxent River, Md. 

Greville, Thomas N. Ek Ph.D. (Michigan) Actuarial Math., Bur. of the Census, Wash¬ 
ington, 25, D. C. 1714 34th St,, N,W,, Washington, 7, D, C, 
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GtfffBn, John 1* Ph.D. (Columbia) Instn Coll, of the City of Ne# York and Long Islaiid 
Univ., New York, N. Y. 115 Henry St, Brooklyn, N, Y, 

Grlffitta, F^f. C. H. Fh.D. (Michigan) Univ. of Michigan, Ann Arbor, Mich. 1507 
Charlton Ave. 

Oroth, Alton O. M.S. (Iowa) Asst. Actuary, Equitable Life Ins. Co. of Iowa, Des Moines, 

6, Iowa. 

Grove, Asst. Prof. Charles C. Ph.D. (Johns Hopkins) C!oll. of the City of New York, 
New York, N. Y, 14S Milbum Ave», Baldwin, L. N. Y. 

Grubbs, Frank £. M.A. (Michigan) Major, Ballistic Res. Lab., Aberdeen Proving 
(jlouDd, Md. 

Guard, Asst. Prof. Harris T. M.S. (Colorado) Instr. Dept, of Math., USMA, West 
Point, N. Y. 

Guilford, Prof. J. P. Ph.D. (Cornell) Univ. of Southern California, Los Angeles, Calif.; 
Lieut. Col., Army Air Force, Dir. Psych. Res. Unit $3^ Santa Ana Army Air Base, 
Santa Ana, Calif. 509 No, Rexford Dr., Beverly Hills, Calif. 

•Gumbel, Prof. Emil J. Ph.D. (Munich) Visiting Prof. Grad, faculty New School for 
Social Res., 66 W. 12th St., New York, N. Y. S890 Waldo Ave., New York, 6S, N. Y. 
Gunlogson, L. S. B.B.A. (Minnesota) Ensign, U. S. Navy, USS C. K. Bronson (DD668), 
FPO San Francisco, Calif. SS05 Cedar Ave. 

Gurland, John M.A. (Toronto) Demonstrator, Univ. of Toronto, Toronto, Can. 
Admiral Rd. 

Gurney, Margaret Ph.D. (Brown) Economist, Bur. of the Budget, Washington, D. C. 
610B Lombard St., Cheverly, Hyatisville, Md. 

Guttman, Asst. Prof. Louis Ph.D. (Minnesota) (Cornell Univ., 304 McGraw Hall, Ithaca, 
N. Y. 

Gutzman, Wayne W. Ph.D. (Iowa) Ensign, USNR, Naval Proving Gd., Dahlgren,Va. 
Haavelmo, Trygve Cand. Gecon. (Oslo) Ck)mmercial Sec., Norwegian Embassy, 3401 
Mass. Ave., N.W., Washington, D. C. 

Hadley, Clausin D. Ph.D. (Wisconsin) Stat., Eli Lilly & Co., 740 S. Alabama St., In- 
dianpolis, 6, Ind. 

Hagood, Mrs. Margaret J. Ph.D. (North Carolina) Sr. Social Scientist, Bur. of Agric. 

Economics, U. S. Dept, of Agric., Washington, D. C. 

Haines, Harold M.S. (Now York) Res. Asst., Burndy Eng. Co., 459 E. 133 St., New 
York, N. Y. 65-65 Booth Si., Forest Hills, L. /. N. Y. 

Halbert, K. W. A.M. (Harvard) Stat. Amer. Tel. & Tel. Co., 195 Broadway, New York, 

7, N. Y. 

Hall, Asst. Prof. Marguerite F. Ph.D. (Michigan) Pub. Health Stat., Univ. of Michigan, 
Ann Arbor, Mich. B5 Ridgeway. 

Halmos, Asst. Prof. Paul R. Ph.D. (Illinois) Syracuse Univ., Syracuse, 10, N. Y. 
Hamilton, Prof. Thomas R. Ph.D. (Columbia) Texas A. and M., (College Station, Tex. 
Box S04, Faculty Exchange. 

Hammer, Hans-Karl Ph.D. (Munich) Pillsbury Mills, Inc., 21 West St., New York, 6, 
N. Y. 304 W. 105th St, New York, 35, N. Y. 

Hammer, Asst, Prof. Preston C. Ph.D. (Ohio State) Oregon State CJoll. Corvallis, Ore. 
Hand, Howard J. B.S. (Carnegie Inst. Tech.) Analytical Eng.,‘ National Tube Co., 
Frick Bldg., Pittsburgh, Pa. 135 Bayard Place, Pittsburgh IS, Pa. 

^Hansen, Morris H. M.A. (American) Stat. Asst, to the Dir., Bur. of the Census, Wash¬ 
ington, D. C. 513 Qoddard Rd., Bethesda, Md. 

Harold, Miriam S. B.A. (Hunter) Member of Tech. Staff, Bell Tel. Labs., Murray Hill, 
N. J. 19 Hillside Ave,, Chatham, N, J, 

Harshbarger, Prof. Boyd Ph.D. (George Washington 1 Virginia Polytechnic Inst., and 
Virginia Agric. Exp. Sta., Blacksburg, Va. 

Hart, Alex L. Ph.D. (Minnesota) Asst. Dir. Res. A Planning Dept., Eastern Air Lines, 
Inc., 10 Rockefeller Plaza, New York, N. Y. 141-34 79th Ave., Flushing, L. I., N. Y. 
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Hart, Prof. WlUUm 1. Ph.D. (Chicago) Univ, of MiimeBota, Minneapolis, Mh^« 

Hasel, Austin A. B.S.F. (Michigan) Silviculturist, California Forest and Itange fixp. 
Sta., 831 Giannini Hall, Berkeley, Calif. 

Hatke, Sister M. Agnes M.S. (Purdue) St. Francis College, 2701 Spring Street/iFort 
Wayne 8, Ind. 

Head, George A. B.E.E. (New York) Tech. Staff, Bell Telephone Labs., Inc., Murray 

Hebley, Henry F. Product Control Mgr., Pittsburgh CJoal Ck>., Oliver Bldg., Pittsburgh, 
Pa. 

Heczko, Leah Naugle A.M. (Michigan) Stat., Dept, of Labor & Industry, State Bldg., 
Lansing, Mich. 
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